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204 



Motif Specification 

XXXX(FY)XX(LIMV) 

XXXX(FY)XXX(LIMV) 

XXXXNXXX(LIMV) 

XXXXNXXXX(LIMV) 

X(LM)XXXXXXV 

X(LM)XXXXXXXV 

X(LMVT)XXXXXX(KRY) 

X(LMVT)XXXXXXX(KRY) 

XPXXXXXX(LIMVF) 

XPXXXXXXX(LIMVF) 



206 



FIG.11A 
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MaxInsertions={enter value here} 208 
OutputToScreen=yes/no 210 
OutputToFi 1 e=yes/no 212 
MinimumAccepted={enter value here} 214 
MaxDupl i cateFuncti onVal ues={enter val ue here} 
MaxSearchTime (min. )={enter value here} 218 
Exhaustive=yes/no 220 
NumStochasticProbes={enter value here} 222 
MaxHitsPerProbe={enter value here} 224 
RandomProbeStart=yes/no 226 

FIG.1 "IB 
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FIG. 12 



( End ) 
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Junctional Analyzer run on Saturday, February 26, 2000 09:06:23 pm. 18/90 
The following non-zero AA weights will be used. 
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The following 10 motif specifications will be used to seorch for junctionals. 
Count Motif Specification 

1 XXXX(FY)XX(LIMV) 

2 XXXX(FY)XXX(LIMV) 

3 XXXXNXXX(LIMV) 

4 XXXXNXXXX(LIMV) 

5 X(LM)XXXXXXV 

6 X(LM)XXXXXXXV 

7 X(LMVT)XXXXXX(KRY) 

8 X(LMVT)XXXXXXX(KRY) 

9 XPXXXXXX(LIMVF) 

10 XPXXXXXXX(LIMVF) 



Code 


Peptide 
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Max Insert ions = 4 (208) 



FIG.13A 
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OutputToScreen = No 
OutputToFile = Yes 
MinimumVolueAccepted = 0 
MaxDuplicateFunctionVolues = 50 
SearchTime = 5 
NumStochasticProbes = 10 
MaxHitsPerProbe = 25 
RandomProbeStart = Yes 
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MoxFunc. 



A 


C 


A 


A 


C 




A 


C 




A 


C 




A 


C 




A 


C 




A 


C 




A 


G 




A 


C 


A 


A 


C 




B 


c 


A 


CD 


c 


A 


8 


c 


A 


B 


c 


A 


B 


c 


A 


B 


c 




B 


c 




CD 


c 


A 


B 


c 


A 


B 


c 


A 


C 


c 


A 


C 


c 


A 


C 


c 




C 


c 


A 


C 


c 




C 


c 




C 


c 




C 


c 


A 


C 


c 


A 


C 


c 


A 



A 
A 
A 



A 
A 



8.80 
8.80 
8.80 
8.80 
1.57 
3.14 
6.28 
2.39 
5.32 
6.28 
5.32 
6.28 
6.28 
6.28 
2.66 
3.14 
6.28 
2.66 
5.32 
5.32 
3.14 
3.14 
4.40 
3.14 
3.14 
3.14 
6.28 
3.14 
6.28 
6.28 



FIG.13B 
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E22 60mer polypeptide (- GPGPG spacers) 
^ 75mer polypeptide (+ GPGPG spacer) 
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EP-HIV-1090 

MGMQVQIQSLFLLLLWVPGSRGKLVGKLNWAGAAILKEPVHGVNAACPKVSFEPIKIPIHYCAPAKAKFVAAW 
TLKAMKAFPVRPQVPLGMKLTPLCVTLGAMVUTO 

SDAKNIPYNPQSQGVVKHPVHAGPIANVTVYYGVPVWKKAAAQMAVFIHNFKNAAAYPLASLRSLFNLTFGWC 
FKLNRILQQLLFINAKIQNFRVYYRKAAVTIKIGGQLKKVPLQLPPLKAMTNNPPIPV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAAGCTGG 
TGGGCAAACTCAACTGGGCCGGAGCTGCAATCCTGAAGGAGCCCGTCCACGGGGTGAATGCCGCTTGCCCTAA 
AGTCAGCTTCGMCCMTTMGATCCCCATTCATTACTGTGCACCTGCCAAAGCTAAGTTTGTGGCCGCTTGG 
ACCCTCAAGGCCGCTGCAAAAGCCTTCCCAGTGAGGCCCCAGGTGCCTCTGGGCGCCGCTAAACTCACACCAC 
TGTGCGTCACTCTGGGAGCCGCTGCAGTGCTGGCAGAGGCCATGTCCCAAGTGAAGGTGTATCTGGCTTGGGT 
GCCCGCCCACAAGGGGGCCGCTGCAGCCATCTTTCAGTCTAGCATGACCAAGAAAACAACTCTGTTCTGTGCC 
TCCGACGCTAAGAACATCCCTTATAATCCACAGTCTCAGGGCGTGGTCAAGCATCCCGTGCACGCCGGACCTA 
TTGCTMCGTGACCGTGTACTATGGGGTCCCAGTGTGGAAGAAAGCCGCTGCACAGATGGCCGTGTTTATTCA 
CMTTTCAAAAACGCCGCTGCATACCCCCTCGCCAGCCTGAGATCCCTCTTCAACCTGACATTCGGCTGGTGC 
TTTAAGCTGAACCGGATCCTGCAGCAACTGCTCTTTATCAATGCTAAAATCCAGAACTTCCGCGTCTACTATA 
GGAAGGCTGCAGTGACTATCAAAATTGGCGGACAACTGAAGAAAGTGCCTCTCCAGCTGCCCCCTCTCAAGGC 
AATGACCAACAATCCCCCTATCCCAGTCTGA 

HIV-CPT 

MGMQVQIQSLFLLLLWVPGSRGIPIHYCAPAKMKIQNFRVYYRKMVTIKIGGQLKKAKFVMWTLKAAAKV 
PLQLPPLKAI FQSSMTKKLTPLCVTLGAQMAVFIHNFKGAKVYLAWVPAHKNAI PYNPQSQGVVKAI LKEPVH 
GVGAAALTFGWCFKLNAVLAEAMSQVNRILQQLLFINAAACPKVSFEPIKVTVYYGVPVWKKAAHPVHAGPIA 
NAAAYPLASLRSLFNAAATTLFCASDAKNKLVGKLNWANAAAFPVRPQVPLNMTNNPPIPV 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAATCCCCA 
TTCACTACTGCGCCCCTGCTAAGGCAGCCAAAATCCAGAACTTCAGGGTGTATTACAGAAAGGCTGCAGTCAC 
CATTAAMTCGGCGGACMCTGMGAMGCCMGTTTGTGGCCGCTTGGACACTCAAGGCCGCTGCAAAGGTC 
CCACTGCAGCTCCCCCCTCTGAAGGCCATCTTCCAGAGCTCCATGACTAAGAAACTGACCCCACTGTGTGTGA 
CACTCGGGGCCCAGATGGCTGTGTTCATCCATAATTTTAAAGGCGCCAAGGTCTACCTGGCTTGGGTGCCCGC 
ACACAAGAACGCCATTCCTTACAATCCACAGTCTCAAGGAGTGGTCAAAGCTATTCTGAAGGAGCCCGTGCAC 
GGGGTGGGCGCCGCTGCACTCACTTTCGGATGGTGCTTTAAACTGAACGCCGTGCTGGCTGAAGCCATGAGCC 
AGGTCAATCGGATCCTGCAGCAACTGCTCTTCATTAACGCCGCTGCATGTCCTAAGGTGTCCTTCGAGCCAAT 
CAAAGTGACCGTGTATTACGGGGTCCCCGTGTGGAAGAAAGCCGCTCATCCTGTCCACGCAGGCCCAATCGCC 
MCGCCGCTGCATATCCCCTCGCCTCTCTGCGCAGCCTGmMCGCCGCTGCMCMCCCTCTTTTGCGCCT 
CCGACGCTAAGAATAAACTGGTGGGAAAGCTGAACTGGGCCAACGCAGCTGCATTCCCTGTGAGGCCACAGGT 
CCCCCTCAATATGACTAACAATCCCCCTATCCCAGTGTGA 



FIG.18A 
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HIV-FT 

MQVQIQSLFLLLLWVPGSRGKLVGKLNWAMASDFNLPPVAIFQSSMTKVTIKIGGQLKRILQQLLFIMAVFIH 
NFKIPYNPQSQGVVTTLFCASDAKILKEPVHGVQMAVFIHNFKGAAVFIHNFKRCPKVSFEPIKIQNFRVYYR 
LTFGWCFKLQVPLRPMTYKMTNNPPIPVTVYYGVPVWKVLAEAMSQVIPIHYCAPAKLTPLCVTL 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAAGCTGGTGGGGA 
AGCTGAACTGGGCCATGGCCAGCGATTTCAACCTGCCCCCCGTGGCCATCTTCCAGAGCAGCATGACCAAGGT 
GACCATCAAGATCGGGGGGCAGCTGAAGAGGATCCTGCAGCAGCTGCTGTTCATCATGGCCGTGTTCATCCAC 
AACTTCAAGATCCCCTACAACCCCCAGAGCCAGGGGGTGGTGACCACCCTGTTCTGCGCCAGCGATGCCAAGA 
TCCTGAAGGAGCCCGTGCACGGGGTGCAGATGGCCGTGTTCATCCACAACTTCAAGGGCGCCGCCGTGTTCAT 
CCACAACTTCAAGAGGTGCCCCAAGGTGAGCTTCGAGCCCATCAAGATCCAGAACTTCAGGGTGTACTACAGG 
CTGACCTTCGGGTGGTGCTTCAAGCTGCAGGTGCCCCTGAGGCCCATGACCTACAAGATGACCAACAACCCCC 
CCATCCCCGTGACCGTGTACTACGGGGTGCCCGTGTGGAAGGTGCTGGCCGAGGCCATGAGCCAGGTGATCCC 
CATCCACTACTGCGCCCCCGCCAAGCTGACCCCCCTGTGCGTGACCCTG 

FIG.18B 
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HIV-TC 

MGMQVQIQSLFLLLLWVPGSRGYWQATWI PEWKAI FQSSMTKKVYLAWVPAHKNAACPKVSFEPI KHPVHAGP 

IANLTFGWCFKLNKMIGGIGGFIKFRDYVDRFYKAAARILQQLLFINTTLFCASDAKNQMVHQAISPRGAKLV 

GKLNWAGAAAIYETYGDTWKAAQVPLRPMTYKGAAAVTVLDVGDAYNAAARYLKDQQLLNTLNFPISPINMTN 

NPPIPVNAPYNTPVFAIKAAAVPLQLPPLKAAIPYNPQSQGVVKALLQLTVWGIGAAILKEPVHGVNAAAFPI 

SP I ETVKVWKEATTTLFKAAAVTI KI GGQLKKI YQEPFKNLKAAAVLAEAMSQVNL VGPTP VN IGAAAEVN I V 

TDSQYKAMIPIHYCAPAK^VIYQYMDDLYKAMQMAVFIHNFKNMTYQIYQEPFKPYNEWTLELKAKIQNF 

RVYYRKAFPVRPQVPLGAMIWGCSGKLIKVMIVWQVDRNMKMCWWAGIKAKFVMWTLKAAAKLTPLCVT 

LNAAMASDFNLPPVKSLLNATDIAVNVTVYYGVPVWKKAAAAIIRILQQLKRAMASDFNLNAAAYPLASLRSL 
F 

ATGGGGATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCTAGAGGATACTGGC 

MGCTACTTGGATTCCAGMTGGAMGCTATCTTTCAATCCTCAATGACGAAGAAGGTATACCTGGCATGGGT 

CCCAGCACACAAGAACGCCGCTTGCCCAAAGGTGTCCTTTGAACCCATTAAACACCCAGTGCACGCAGGGCCA 

ATAGCGMTTTGACATTCGGGTGGTGCTTCAMCTAMCAAMTGATCGGCGGCATTGGAGGCTTTATCAAGT 

TTAGAGATTACGTGGACCGATTCTATAAAGCCGCTGCCCGTATACTCCAGCAGCTACTATTCATCAACACCAC 

TCTCTTCTGCGCTTCAGACGCTAAGAACCAAATGGTACACCAAGCCATAAGCCCTAGAGGAGCCAAGCTCGTA 

GGGAMTTAAATTGGGCGGGTGCAGCAGCAATCTACGAGACTTACGGCGATACCTGGAAAGCAGCCCAGGTTC 

CGTTACGCCCAATGACCTATAAAGGCGCAGCAGCAGTAACAGTTCTAGATGTAGGAGACGCTTACAACGCTGC 

CGCMGATACCTAAMGATCAGCAGTTACTCAACACACTAAATTTCCCAATTAGCCCGATAAACATGACAAAT 

AACCCACCAATTCCCGTCAATGCTCCCTACAACACTCCAGTATTCGCAATCAAAGCCGCTGCTGTCCCCCTGC 

AGCTCCCTCCTCTGAAAGCTGCGATACCTTACAACCCACAGAGCCAAGGTGTTGTCAAAGCACTGCTTCAGCT 

MCAGTTTGGGGMTTGGTGCTGCMTTCTAAAAGAGCCAGTTCATGGGGTTAACGCCGCCGCCTTCCCAATC 

AGTCCTATTGAGACTGTGAAAGTATGGAMGMGCCACMCCACACTTTTTMGGCAGCCGCAGTTACAATTA 

AMTAGGGGGCCMCTTMGAAMTATACCAGGMCCTTTCAAGAATCtCAAAGCCGCTGCAGTGCTCGCCGA 

GGCTATGTCACAGGTGAATTTGGTCGGACCAACACCCGTAAACATCGGAGCCGCAGCCGAAGTGAACATAGTC 

ACCGACTCACAGTACAAAGCCGCTGCAATACCCATACATTATTGTGCTCCCGCAAAGGCCGTGATCTATCAAT 

ATATGGACGACCTGTATMGGCCGCCGCGCAGATGGCAGTCTTTATCCACAACTTTAAAAACGCAGCTACTTA 

TCAGATCTACCAGGMCCATTCAMCCGTACMTGAGTGGACCTTGGAACTAAAGGCCAAAATTCAGAACTTC 

AGGGTATATTATAGAAMGCATTTCCAGTGAGGCCCCAGGTGCCTCTGGGTGCCGCAGCAATATGGGGATGTT 

CTGGAAAACTGATCAAGGTGATGATTGTATGGCAAGTGGACAGAAATGCAGCTAAGGCAGCCTGTTGGTGGGC 

AGGTATAAAAGCAAAGTTCGTGGCAGCATGGACGCTTAAAGCAGCCGCAAAACTCACTCCTCTCTGCGTGACA 

CTTAATGCAGCCATGGCCTCTGATTTCAACCTTCCCCCTGTAAAATCCCTGCTTAATGCGACAGATATCGCAG 

TCAACGTAACAGTATATTATGGCGTGCCAGTCTGGAAAAAAGCCGCCGCGGCCATAATTCGGATACTGCAGCA 

GCTGAAAAGAGCTATGGCGAGTGACTTCAACCTGAATGCGGCCGCCTACCCCTTGGCATCGTTAAGGTCACTA 
TTTTGA 



FIG.18C 
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HCV.l 

MGMQVQIQSLFLLLLWVPGSRGLLFNILGGWVDLMGYIPLVYLVAYQATVILAGYGAGVRLIVFPDLGVHMWNFISGI 
YLLPRRGPRLYLVTRHADVVLVGGVLAALLFLLLADAFLLLADARVWMNRLIAFACTCGSSDLYLSAFSLHSYGVAGA 
LVAFKLPGCSFS I FKTSERSQPRL I FCHSKKKFWAKHMWNFI PFYGKAIRMYVGGVEHRQLFTFSPRRRLGVRATRKV 
GIYLLPNRAKFVAAWTLKAAA* 

GAATTCGCCGCCACCATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGACTG 
CTGTTCAACATCCTGGGGGGGTGGGTGGATCTGATGGGGTACATCCCCCTGGTGTACCTGGTGGCCTACCAGGCCACC 
GTGATCCTGGCCGGGTACGGGGCCGGGGTGAGGCTGATCGTGTTCCCCGATCTGGGGGTGCACATGTGGAACTTCATC 
AGCGGGATCTACCTGCTGCCCAGGAGAGGACCTAGACTGTACCTGGTGACTAGACACGCTGATGTGGTGCTGGTGGGA 
GGAGTGCTGGCTGCTCTGCTGTTTCTGCTGCTGGCTGATGCTTTCCTGCTGCTGGCTGATGCTAGAGTGTGGATGAAC 
AGACTGATCGCTTTCGCTTGTACATGTGGAAGCTCCGATCTGTATCTGAGCGCTTTCAGCCTGCACAGCTACGGAGTG 
GCTGGAGCTCTGGTGGCTTTTMGCTGCCTGGATGTAGCTTTAGCATCTTTAAGACCAGCGAAAGAAGCCAGCCTAGA 
CTGATCTTTTGTCACAGCMGMGMGTTTTGGGCTM 

AGAATGTATGTGGGAGGAGTGGAACACAGACAGCTGTTTACATTTAGCCCTAGAAGGAGACTGGGAGTGAGAGCTACA 
AGAAAGGTGGGAATCTATCTGCTGCCTAATAGATGAAAGCTTGGG* 

HCV.2 

MGMQVQIQSLFLLLLWVPGSRGDLMGYIPLVAKFVAAWTLKAAALLFLLLADALIFCHSKKKQLFTFSPRRYLVTRHA 
DVYLLPRRGPRLCTCGSSDLYHMWNFISGIRJAKHMWNFAKFVAAWTLKAAAILAGYGAGVYLVAYQATVGVAGALVA 
FKIPFYGKAIRMYVGGVEHRVLVGGVLMFLLU\DARVLPGCSFSIFAKFVAAWTLKAAAKTSERSQPRRLGVRATRK 
RLIVFPDLGVWMNRLIAFALSAFSLHSYLLFNILGGWVVGIYLLPNR* 

GMTTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 

GGAGATCTGATGGGATATATCCCTCTGGTGGCTMGTTTGTGGCTGCTTGGACACTGAAGGCTGCTGCTCTGCTGTTT 

CTGCTGCTGGCTGATGCTCTGATCTTCTGTCACAGCMGMGMGCAGCTGTTTACATTTAGCCCAAGAAGATATCTG 

GTGACAAGACACGCTGATGTGTATCTGCTGCCTAGACGCGGACCTAGACTGTGTACATGTGGAAGCTCCGATCTGTAT 

CACATGTGGMCTTTATCAGCGGMTC7TTTGGGCTMGCACATGTGGAATTTCATCCTGGCTGGATATGGAGCTGGA 

GTGTATCTGGTGGCTTATCAGGCTACAGTGGGAGTGGCTGGAGCTCTGGTGGCTTTCAAGATCCCATTCTATGGAAAG 

GCTATCAGAATGTATGTGGGAGGAGTGGAACACAGAGTGCTGGTGGGAGGAGTGCTGGCTGCTTTCCTGCTGCTGGCT 

GATGCTAGAGTGCTGCCAGGATGTAGCTTTAGCATCTTCAAGACTTCCGAACGCTCCCAGCCTAGAAGACTGGGAGTG 

AGAGCTACMGGMGAGACTGATCGTGTTTCCAGATCTGGGAGTGTGGATGAATAGACTGATCGCTTTCGCTCTGAGC 

GCTTTCAGCCTGCACAGCTATCTGCTG7TCAACATCCTGGGAGGATGGGTGGTGGGAATCTATCTGCTGCCAAACAGA 
TGAAAGCTT 

HCV.3sl 

MGMQVQIQSLFLLLLWVPGSRGYLVAYQATVAKFVAAWTLKAAALLFLLLADALIFCHSKKKYLVTRHADVLGFGAYM 
SKCTCGSSDLYHMWNFISGIFWAKHMWNF* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGA 
GGATACCTCGTCGCCTACCAGGCCACTGTGGCTAAATTCGTGGCAGCCTGGACACTGAAAGCTGCAGCTCTGCTCTTC 
CTGCTCCTGGCCGATGCACTCATCTTCTGCCATTCCAAGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCTGGGG 
TTTGGCGCCTACATGAGCMGTGCACCTGTGGCAGCTCCGACCTGTATCACATGTGGMCTTTATTTCTGGAATCTTT 
TGGGCCAAGCACATGTGGAATTTCTGAAAGCTT 
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HCV.3s2 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVUW\KFVMWTLI<^FLLLADARVLSAFSLHSYILAGYGAGVWM 
NRLIAFAIPFYGKAIVAGALVAFKVGIYLLPNR* 

GMTTCGCCGCCACCATGGGMTGCAGGTGCAGATCCAMGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCTTGGACACTGAAGGCAGCCGC 
TTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGC 
GCAGGCGTGTGGATGMTCGGCTGATCGCCTTTGCCATTCCATTCTATGGGAAAGCCATTGTGGCTGGCGCCC 
TGGTGGCATTCAAGGTCGGGATCTACCTCCTGCCTAACCGCTGAAAGCTT 

HCV.3s2(-3) 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVUW^KFVMWTLKAMFLLUDARVLSAFSLHSYIUGYGAGVWM 
NRLIAFA* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCCGCTGCTAAGTTTGTCGCTGCTTGGACACTGAAGGCAGCCGC 
TTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGC 
GCAGGCGTGTGGATGAATCGGCTGATCGCCTTTGCCTGAGGATCC 

HCV.3S3 

MGMQ VQ I QS LFLLLLWVPGSRGDLMGY I P LVAKFVAAWTLKAAARLGVRATRKL LFN I LGGWVRMY VGGVEHR 
RLIVFPDLGVGVAGALVAFKLPGCSFSIFKTSERSQPRQLFTFSPRRYLLPRRGPRL 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAGACCTGATGGGCTACATCCCTCTCGTGGCCAAGTTTGTGGCAGCTTGGACCCTGAAGGCCGCTGC 
CAGACTGGGAGTGCGCGCTACACGGAAACTCCTGTTTAACATCCTGGGAGGGTGGGTGCGGATGTACGTCGGA 
GGCGTCGAGCACAGAAGGCTCATTGTCTTTCCAGATCTCGGCGTGGGCGTCGCAGGCGCACTCGTGGCCTTCA 
MCTGCCAGGGTGCAGCTTCAGCATTTTCAAGACCTCCGAACGCTCCCAACCCAGACAGCTGTTCACTTTCTC 
TCCTCGGAGGTATCTGCTGCCCAGACGCGGACCCAGGCTGTGAAAGCTT 

HCV.PC3 

MGMQVQIQSLFLLLLWVPGSRGLLFNILGGWVKAKFVAAWTLKAAALADGGCSGGAYRLIVFPDLGVKFWAKH 
MWNFIGVAGALVAFKKQLFTFSPRR* 

GMTTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGC 
TGCCGCTCTGGCCGACGGGGGATGCAGCGGCGGAGCTTACAGGCTCATTGTCTTTCCCGATCTCGGAGTCAAA 
TTTTGGGCAMGCACATGTGGMnTCATCGGGGTGGCCGGAGCCCTGGTCGCTTTTAAAMGCAGCTCTTCA 
CCTTCTCCCCAAGACGGTGAGGTACC 
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HCV.PC4 

MGMQVQIQSLFLLLLWVPGSRGRLGVRATRKKAKFVAAWTLKAAAKTSERSQPRNLPGCSFSIFNDLM6YIPL 
VKYLLPRRGPRLNTLCGFADLMGYRMYVGGVEHR* 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGAT 
CCAGAGGAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTAAMCMGCGAGCGCTCCCAGCCCAGGMCCTGCCTGGATGCTCTTTCAGCATCTTTAATGACCTCATG 
GGGTACATTCCACTGGTGAAGTATCTGCTCCCCAGACGGGGCCCTCGCCTGAACACTCTCTGTGGATTTGCTG 
ATCTGATGGGGTACAGGATGTATGTCGGCGGAGTCGAACACAGATGAGGTACC 

HCV.243K1P) 

MGMQVQIQSLFLLLLWVPGSRGVLVGGVLAAAFLLLADARVLSAFSLHSYILAGYGAGVWMNRLIAFAGAAAR 

LGVRATRKKAAAKTSERSQPRNLPGCSFSIFNDLMGYIPLVKYLLPRRGPRLNTLCGFADLMGYRMYVGGVEH 

RKLLFNILGGWVKAAALADGGCSGGAYRLIVFPDLGVKFWAKHMWNFIGVAGALVAFKKQLFTFSPRRNGYLV 

AYQATVAAALLFLLLADALIFCHSKKKYLVTRHADVLGFGAYMSKCTCGSSDLYHMWNFISGIFWAKHMWNFK 
AAAAKFVAAWTLKAAA 

GMTTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGCT 
CCAGAGGAGTCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGCTCCTGGCAGACGCCAGGGTGCTGTCTGC 
CTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGtGGATGAATCGGCTGATCGCCTTTGCC 
GGCGCTGCCGCAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCTGCCGCTAAAACAAGCGAGCGCTCCCAGC 
CCAGGMCCTGCCTGGATGCTCTTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCT 
GCTCCCCAGACGGGGCCCTCGCCTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTATGTC 
GGCGGAGTCGAACACAGAAAACTGCTCTTCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCTGGCCGACG 
GGGGATGCAGCGGCGGAGCTTACAGGCTCATTGTCTTTCCCGATCTCGGAGTCAMTTTTGGGCAAAGCACAT 
GTGGMTTTCATCGGGGTGGCCGGAGCCCTGGTCGCTTTTAAAAAGCAGCTCTTCACCTTCTCCCCAAGACGG 
AACGGATACCTCGTCGCCTACCAGGCCACTGTGGCTGCAGCTCTGCTCTTCCTGCTCCTGGCCGATGCACTCA 
TCTTCTGCCATTCCAAGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCTGGGGTTTGGCGCCTACATGAG 
CMGTGCACCTGTGGCAGCTCCGACCTGTATCACATGTGGMCTTTATTTCTGGMTCTTTTGGGCCAAGCAC 
ATGTGGMTTTTAAGGCCGCAGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAGGATCC 
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HCV.4312QP) 

MGMQVQIQSLFLLLLWVPGSRGRLGVRATRKKAAAKTSERSQPRNLPGCSFSIFNDLMGYIPLVKYLLPRRGPRLNTLC 
GFADLMGYRMYVGGVEHRKLLFNILGGWVKAAALADGGCSGGAYRLIVFPDLGVKFWAKHMWNFIGVAGALVAFKKQLF 
TFSPRRNGYLVAYQATVAAALLFLLLADALIFCHSKKKYLVTRHADVLGFGAYMSKCTCGSSDLYHMWNFISGIFWAKH 
MWNFKKAMVLVGGVUW\FLLUDARVLSAFSLHSYIUGYGAGVWMNRLIAFANAAAKFVMWTLKA^ 

GAATTCGCCGCCACCATGGGAATGCAGGTGCAGATCCAAAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGCTCCAGAG 

GAAGGCTGGGCGTGAGAGCCACCCGGAAGAAGGCTGCCGCTAAAACAAGCGAGCGCTCCCAGCCCAGGAACCTGCCTGG 

ATGCTCTTTCAGCATCTTTAATGACCTCATGGGGTACATTCCACTGGTGAAGTATCTGCTCCCCAGACGGGGCCCTCGC 

CTGAACACTCTCTGTGGATTTGCTGATCTGATGGGGTACAGGATGTATGTCGGCGGAGTCGAACACAGAAAACTGCTCT 

TCAACATCCTGGGCGGATGGGTGAAGGCTGCCGCTCTGGCCGACGGGGGATGCAGCGGCGGAGCTTACAGGCTCATTGT 
CTTTCCCGATCTCGGAGTCAMTTTTGGGCAMGCACATGTGG 

AAAAAGCAGCTCTTCACCTTCTCCCCAAGACGGAACGGATACCTCGTCGCCTACCAGGCCACTGTGGCTGCAGCTCTGC 
TCTTCCTGCTCCTGGCCGATGCACTCATCTTCTGCCATTCCAAGAAAAAGTATCTGGTCACCAGACATGCTGACGTGCT 
GGGGTTTGGCGCCTACATGAGCMGTGCACCTGTGGCAGCTCCGACCTGTATCACATGTGGMCTTTATTTCTGGAATC 
TTTTGGGCCMGCACATGTGGMTTTTAAGAAAGCCGCTGCAGTCCTGGTGGGCGGCGTCCTGGCAGCCGCTTTCCTGC 
TCCTGGCAGACGCCAGGGTGCTGTCTGCCTTCAGCCTCCACTCCTACATCCTCGCAGGGTATGGCGCAGGCGTGTGGAT 
GAATCGGCTGATCGCCTTTGCCAATGCTGCAGCTAAATTCGTGGCAGCCTGGACACTGAAAGCAGCTGCATGAGGATCC 

AOSI.K 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVKFLLSLGIHLYMDDVVLGVGLSR 
YVARLFLLTRILTISTLPETTVVRRQAFTFSPTYKWLSLLVPFV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCCTAGCGATTTCTT 
TCCTAGCGTGAAGTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGG 
TACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 
CCTTCACCTTTAGCCCTACCTATAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGTGA 

HBV.l 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVFLLSLGIHLYMDDVVLGVGLSRY 
VARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKWLSLLVPFVI P I PSSWAFTPARVTGGVFKVGNFTGLYLPSDFFPS 
VTLWKAGILYKNVSIPWTHKLVVDFSQFSRSAICSVVRRALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTCTT 
TCCTAGCGTGTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGGTAC 
GTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGGCCT 
TCACCTTTAGCCCTACCTATAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGATCCCTATCCCTAGCTCCTGGGCTTTCAC 
CCCAGCCAGGGTGACCGGAGGAGTGTTTMGGTGGGAMCTTCACCGGCCTGTATCTGCCCAGCGATTTCTTTCCTAGC 
GTGACCCTGTGGAAGGCCGGGATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGCTGGTGGTGGACTTTTCCC 
AGTTCAGCAGATCCGCTATCTGCTCCGTGGTGAGGAGAGCTCTGATGCCACTGTATGCCTGTATCTGA 
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HBV.2 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVLGVGLSR 
YVARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVNI P I PSSWAFKTPARVTGGVFKVGNFTGLYN 
LPSDFFPSVKTLWKAGILYKNVSIPWTHKGAALVVDFSQFSRNSAICSVVRRALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCCTAGCGATTTCTT 
TCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCCAGG 
TACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGCAGG 
CCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCTATCCC 
TAGCTCCTGGGCTrTCAAGACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGGCCTGTATAAC 
CTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGA 
CCCACAAGGGAGCCGCTCTGGTGGTGGACTrrTCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTGAGGAGAGC 
TCTGATGCCACTGTATGCCTGTATCTGA 

PfCTL.l 

MQVQIQSLFLLLLWVPGSRGILSVSSFLFVNAMQTNFKSLLRNLPSENERGYKAMLLACAGLAYKKAAMKFVMWT 
LKAAAKAFMKAVCVEVNAMSFLFVEALFNATPYAGEPAPFKAAAKYKLATSVLKTVGVSENIFLKNAMYFILVNLl 

agllgvvstv 

atgggaatgcaggtgcagatccagagcctgtttctgctcctcctgtgggtgcccggatccagaggaatcctgagcgtgt 
cctctttcctgtttgtcmcgccgctgcacagaccmtttcmgagcctcctgaggmcctcccctccgagaacgaaag 
aggctacaaagccgctgcactgctcgcctgcgctggactggcctataagaaagccgctgcagccaagttcgtggccgct 
tggacactgmggccgctgcaamgcctttatgmggctgtctgtgtggaggtcmtgccgctgcatctttcctgtttg 
tggaggccctctttaacgctactccttacgcaggggaaccagcccccttcaaggccgctgcaaaatataagctggcaac 
cagcgtgctgmggctggcgtgtccgagmtatttttctgaaaaacgccgctgcatacttcatcctggtgaatctgctc 
attaaggccggactcctgggggtggtctctacagtgtga 

PfCTL.2 

MQVQIQSLFLLLLWVPGSRGFVEALFQEYNAAAKYLVIVFLINALACAGLAYKKFYFILVNLLKAALFFIIFNKNAAAK 
FVMWTLKAAAKFILVNLLIFHNFQDEENIGIYKLPYGRTNLKAAAVLLGGVGLVLNFLIFFDLFLVKAVLAGLLGVV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGATTCGTGGAGGCCC 

TGTTTCAGGAATACAACGCCGCTGCAAAGTATCTCGTCATCGTGTTCCTGATCAATGCTCTGGCATGCGCCGGCCTCGC 

TTACAAAMGTTTTACTTCATTCTGGTCMCCTGCTCMGGCCGCTCTGTTCTTTATCATTTTCMTA^ 

GCTMGTTTGTGGCCGCATGGACCCTGMGGCCGCTGCAAMTTCATCCTCGTGMTCTGCTCATTTTTCACAACTTCC 

MGACGAGGAAAATATCGGAATTTATAAGCTGCCCTACGGGAGGACAAACCTGAAAGCCGCTGCAGTCCTGCTCGGCGG 

AGTGGGGCTGGTGCTCMTTTTCTGATCTTCTTTGATCTGTTCCTGGTGAAGGCCGTCCTGGCCGGCCTGCTCGGAGTC 
GTGTGA 
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PfCTL.3 

MQVQIQSLFLLLLWVPGSRGVFLIFFDLFLNAMPSDGKCNLYKAAAVTCGNGIQVRKLFHIFDGDNEIKAHVLSHNSY 

EKNYYGKQENWYSLKKILSVFFLANAMKFIKSLFHIFKAM 

AAGLIMVLSFL 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAGTGTTCCTGATCT 

TCTTTGACCTGTTCCTGAACGCCGCTGCACCCAGCGATGGCAAGTGCAATCTCTACAAGGCCGCTGCAGTGACCTGTGG 

AMCGGGATTCAGGTCAGGAMCTCTTTCACATCTTCGACGGCGATAACGAGATCAAGGCCCATGTGCTGTCCCACAAT 

TCTTATGAAAAAMCTACTATGGAMGCMGAGMTTGGTACAGCCTGMGAAMTTCTGTCCGTGTTCTTTCTCGCCA 

ACGCCGCTGCAMGTTTATCMGTCTCTGTTCCATATTTTCMGGCCGCTGCACTCTACATCAGCTTCTATTTTATTM 

AGCCAMTTTGTGGCCGCTTGGACACTGAAGGCCGCTGCAAAAGCCGCTGCATACTATATCCCTCACCAGAGCTCCCTG 
AAGGCCGCTGCAGGGCTGATCATGGTGCTCTCTTTCCTGTGA 

PfCTL/HTKN) 

MQVQIQSLFLLLLWVPGSRGSSVFNVVNSSIGLIMVLSFLGPGPGLYISFYFILVNLLIFHINGKIIKNSEGPGPGPDS 
IQDSLKESRKLSGPGPGVLAGLLGVVSTVLLGGVGLVLGPGPGLPSENERGYYIPHQSSLGPGPGQTNFKSLLRNLGVS 
ENIFLKGPGPGFQDEENIGIYGPGPGKYLVIVFLIFFDLFLVGPGPGKFIKSLFHIFDGDNEIGPGPGKSKYKLATSVL 
AGLLGPGPGLPYGKTNLGPGPGRHNWVNHAVPLAMKLIGPGPGMRKLAILSVSSFLFVEALFQEYGPGPGVTCGNGIQV 
RGPGPGMNYYGKQENWYSLKKGPGPGPSDGKCNLYADSAWENVKNVIGPFMKAVCVEVGPGPGKILSVFFLALFFIIFN 
KGPGPGHVLSHNSYEKGPGPGKYKIAGGIAGGLALLACAGLAYKFVVPGAATPYAGEPAPF 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGTAGTGTGTTCA 
ATGTTGTGAACTCATCAATTGGTCTGATCATGGTGCTGAGCTTTCTCG 

GGCCAGGGCCAGGATTATATATTTCTTTCTACTTCATCCTTGTCMCCTGTTMTATTCCACATTMCGGCAAMT 

AAAGAACAGTGAAGGCCCTGGGCCTGGGCCTGACTCGATCCAGGATTCTCTAAAAGAATCGAGGAAGCTCTCCGGACCA 

GGCCCTGGTGTACTCGCCGGGTTGCTGGGAGTAGTTAGCACAGTGCTGTTAGGAGGCGTCGGCCTCGTCTTAGGACCTG 

GACCAGGTCTGCCGTCCGAAAACGAAAGAGGATACTACATACCTCACCAGAGCAGCCTCGGCCCAGGCCCCGGACAAAC 

CMTTTCAMTCCCTCTTGCGAMTCTAGGAGTGAGCGAGMCATATTTCTTAMGGACCCGGTCCCGGCTTTCAGGAC 

GAGGAGMTATAGGTATTTACGGTCCAGGACCTGGAAMTACCTAGTGATCGTATTCCTMTTTTTTTTGACCT 

TGGTGGGCCCAGGTCCCGGAMGTTCATTAMTCACTCTTCCACA7TTTTGACGGAGATAACGAGATAGGACCCGGTCC 

CGGGAAATCAAAGTACAAACTAGCCACTTCAGTGCTGGCCGGCCTTCTAGGGCCGGGCCCAGGGCTCCCCTATGGAAAG 

ACAAATCTTGGCCCCGGTCCAGGACGGCACAACTGGGTGAATCATGCGGTTCCATTGGCCATGAAACTAATCGGGCCCG 

GTCCAGGCATGCGCAMCTTGCMTTCTMGCGTMGTTCATTTCTGTTCGTAGAGGCACTGTTTCAAGAATATGGCCC 

AGGACCTGGCGTCACATGTGGGAATGGGATCCAGGTGAGAGGACCGGGACCTGGTATGAACTATTACGGTAAACAGGAA 

AATTGGTACTCCCTGAAAAAGGGTCCAGGCCCCGGCCCCTCAGATGGTAAGTGCAACCTGTATGCTGACTCAGCATGGG 

AGMCGTAAAAMTGTMTAGGCCCATTCATGMGGCAGTTTGTGTCGMGTCGGACCAGGCCCAGGAAAMTACTTTC 

TGTCTTCTTCCTAGCTCTCTTCTTCATCATCTTCAACAAGGGACCAGGGCCAGGTCACGTGTTATCCCATAACTCTTAT 

GAAAAAGGGCCAGGACCTGGGAAATACAAAATCGCAGGAGGGATCGCCGGCGGGCTAGCGCTCCTTGCCTGCGCAGGCT 

TGGCTTACAAAnCGTTGTACCAGGAGCTGCMCACCCTATGCAGGAGMCCTGCCCCATTTTGAAGATCTGC 
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Pf33 

MGMQVQIQSLFLLLLWVPGSRGFMKAVCVEVNVTCGNGIQVRKGLIMVLSFLNAALFHIFDGDNEIKAALLACAGLAYK 

KSFLFVEALFNAAPSDGKCNLYKAAQTNFKSLLRNLPSENERGYKAAGVSENIFLKNAAAYFILVNLLIKAAAILSVSS 

FLFVNTPYAGEPAPFKAMKYKU\TSVLKMVFLIFFDLFLNYYIPHQSSLKMGLLGNVSTVGAVLLGGVGLVLNLAC 

AGU\YKKAKFIKSLFHIFKAAFYFILVNLLKAFLIFFDLFLVKALFFIIFNKNYYGKQENWYSLKFVEALFQEYNAAAK 

FVMVfTLKAMKILSVFFU\NAVLAGLLGNVNFQDEENIGIYKAMLYISFYFIKAFILVNLLIFHNMLPYGR 

AHVLSHNSYEKNAAAKYLVIVFLI 

GCCGCCACCATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGATTTA 
TGAAAGCTGTCTGTGTAGAGGTGAATGTAACATGCGGTAACGGAATTCAGGTGAGAAAGGGACTCATCATGGTACTCAG 
CT1TCTGMCGCAGCCCTGTTCCACATCTTTGACGGAGACAATGAAATCAAAGCCGCATTGCTCGCCTGTGCCGGACTA 
GCCTATAAAMGAGTTTCCTTTTCGTTGMGC^^ 

CAGCTCAGACTMTTTCAAMGCCTGTTMGAMTCTGCCCTCAGAGAATGAAAGGGGTrACAAAGCCGCCGGCGTGTC 
CCJAGMTATTTTCCTGMGMCGCCGCTGCTTA^ 

GTGTCCAGCTTTCTGTTTGTTMCACACCATATGCGGGCGAGCCGGCTCCTTTCMGGCTGCAGCAAAATACAAGCTTG 
CCACATCAGTATTGAMGCAGCTGTGTTTTTGATATTCTTTGATCI 1 1 1 1 1 1 AAACTACTACATACCTCATCAGTCTAG 
TCTTAMGCAGCCGGGCTACTGGGGMCGTCTCTACTGTGGGGGCCGTCTTACTTGGAGGAGTTGGCCTCGTGTTGAAC 
CTCGCGTGCGCAGGTCTGGCCTACAAAAMGCGAMTTCATCMGTCTCTGTTCCACATTTTTAMGCCGCATTCTAT^ 
TCATACTAGTGMCCTTCTCAMGCTTTCCTGATCTTCTTCGATCTATTCCTCGTAAMGCGCTATTCTTCATTATC^ 
TMCAAAAATTATTACGGCMGCMGAAMTTGGTACTCACTCMGTTTGTAGMGCTCTGTTCCAGGAATACAACGCC 
GCTGCTAMTTCGTTGCAGCTTGGACCCTGAAAGCAGCTGCAMGATCCTATCGGTCTTCTTTCTCGCTAATGCCGTAT 
TAGCAGGACTTCTAGGCMCGTGMCTTTCMGACGMGAGMTATAGGCATCTACAMGCCGCAGCACTGTACATTTC 
ATTCTACTTCATCMGGCCTTCATACTGGTCMCCTTCTGATATTTCATAATGCAGCACTGCCATATGGGAGAACCAAC 
TTGAAAGCGGCCCACGTGTTGAGCCACAACTCCTACGAGAAGAACGCCGCCGCGAAATATCTCGTCATTGTCTTCCTGA 
TTTGA 



TB.l 

MQVQIQSLFLLLLWVPGSRGRMSR\m"FTVKALVLLMLPVVNLMIGTAAAVVKALVLLMLPVGAGLMTAVYLVGAAAMA 
LLRLPVKRMFMNLGVNSLYFGGICVGRLPLVLPAVNAAAAKFVMWTLKAMKAMRLMIGTAMGFVVALIPLVNAM 
TYAAPLFVGAAAAMALLRLPLV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGGATGAGCAGAGTGACCA 
CATTCACTGTCAAGGCCCTGGTGCTCCTGATGCTCCCCGTCGTGAACCTGATGATCGGCACCGCTGCAGCCGTCGTGAA 
AGCTCTCGTCCTGCTCATGCTCCCTGTGGGAGCAGGGCTGATGACAGCCGTGTACCTGGTCGGCGCTGCAGCCATGGCC 
CTCCTGCGGCTGCCAGTGMGCGCATGTTTGCTGCAMTCTGGGAGTCMCTCCCTCTATTTCGGGGGCATTTGCGTGG 
GMGGCTGCCCCTCGTGCTGCCTGCTGTGMTGCAGCCGCTGCCAMTTTGTCGCCGCTTGGACTCTGAAGGCAGCCGC 
TAAGGCCGCTGCAAGACTGATGATCGGGACCGCCGCTGCCGGCTTCGTGGTCGCCCTGATTCCCCTGGTGAACGCCATG 
ACATACGCAGCTCCTCTGTTTGTGGGAGCCGCTGCAGCCATGGCTCTCCTGCGGCTGCCACTGGTGTGA 
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BCL A2 #90 

MQVQIQSLFLLLLWVPGSRGIMIGHLVGVNRLLQETELVNAKVAEIVHFLNAKVFGSLAFVNAYLSGANLNVG 
AAYLQLVFG I E VNAAAKFVAAWTLKAAAKAAAVVLGVVFGI NSMPPPGTRVNAAAATVG I M IGVNAKLCP VOL 
WV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAATTATGATCGGCC 

ATCTGGTGGGCGTCAACAGACTGCTGCAGGAAACCGAGCTGGTGAATGCCAAGGTGGCCGAAATTGTGCACTT 

TCTCMCGCAMGGTGTTTGGTTCCCTGGCTTTTGTCAATGCCTATCTGAGCGGCGCTAACCTCAACGTCGGA 

GCCGCCTACCTCCAGCTGGTCTTCGGCATCGAGGTCAACGCTGCTGCAAAATTCGTGGCAGCTTGGACCCTCA 

AGGCTGCAGCAAAGGCTGCCGCCGTCGTGCTCGGAGTGGTGTTCGGGATCAACTCTATGCCACCTCCCGGGAC 

TAGGGTCAATGCTGCCGCCGCAACAGTGGGAATCATGATTGGGGTGAATGCCAAACTGTGCCCAGTGCAACTG 
TGGGTGTGA 

BCL A2 #88 

MQVQIQSLFLLLLWVPGSRGVVLGVVFGINAAAAKFVAAWTLKAAAKVAEIVHFLNAYLSGANLNVGAAYLQL 
VFGIEVNIMIGHLVGVNRLLQETELVNAKVFGSLAFVNAKLCPVQLWVNAAAATVGIMIGVNSMPPPGTRV 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAGTCGTGCTGGGAG 
TCGTCTTCGGCATTAATGCCGCCGCTGCAAAGTTCGTGGCTGCCTGGACCCTGAAGGCCGCAGCTAAAGTGGC 
AGAGATCGTGCACTTTCTGAACGCCTACCTGAGCGGAGCAAATCTGAACGTCGGCGCTGCCTATCTGCAGCTC 
GTGTTTGGAATTGAAGTGAACATCATGATTGGACATCTGGTGGGCGTGAACAGGCTGCTCCAGGAAACTGAGC 
TGGTCAACGCTAAAGTGTTCGGGTCTCTCGCCTTTGTGAACGCTAAGCTCTGCCCCGTCCAACTCTGGGTCAA 
TGCCGCAGCCGCTACAGTGGGGATCATGATCGGCGTGAACTCCATGCCTCCACCAGGGACCAGAGTGTGA 

BCL A2 #63 

MQVQIQSLFLLLLWVPGSRGKLCPVQLWVNAAAATVGIMIGVNIMIGHLVGVNRLLQETELVNAKVAEIVHFL 

NAKVFGSU\FVNAYLSGANLNVGMYLQLVFGIEVNAMKFVMWTLKAAAKAAAVVLGVVFGINSMPPPGTR 
V 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGAAAGCTCTGCCCCG 

TGCAACTGTGGGTCAACGCCGCCGCCGCAACCGTCGGCATTATGATCGGGGTGAACATCATGATCGGACACCT 

GGTCGGCGTGAACAGGCTGCTGCAGGAGACAGAACTGGTCAATGCCAAGGTGGCTGAAATTGTCCATTTCCTG 

AATGCCAAAGTGTTCGGCTCTCTCGCTTTCGTGAACGCTTATCTGAGCGGAGCTAACCTCAACGTGGGGGCCG 

CATACCTCCAGCTCGTCTTTGGGATTGAGGTGAATGCCGCAGCTAAATTTGTCGCTGCCTGGACCCTGAAGGC 

AGCAGCCAAGGCTGCCGCAGTGGTGCTGGGAGTGGTGTTTGGAATCAATTCCATGCCTCCACCAGGCACTAGA 
GTGTGAGGATCC 
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Prostate 1 

LTFFWLDRSVKAMVLVHPQWVLTVKAMLLQERGVAYIKMLLLSIALSVNPLVCNGVLQGVKMIMYSAHD 

nVKAMFLTPKKLQCVNAMMNDQLMFLNAGLPSIPVHPVKAMLGTTCYVGAAILLWQPIPVNFLRPRSLQC 
VKAFLTLSVTWIGVNALLYSLVHNLGAATLMSAMTNL 

ATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATTGACATTTTTTT 
GGCTGGATAGATCGGTTAAGGCTGCAGCeGTGCTTGTTCATCCCCAGTGGGTCTTGACCGTAAAGGCTGCCGC 
GCTGCTACAAGAAAGAGGGGTCGCATACATCAAAGCTGCTCTCCTCTTGAGTATTGCGCTAAGTGTAAACCCG 
CTAGTTTGTMTGGGGTGTTAGAAGGTGTGAAAGCGGCGATTATGTACAGTGCCCACGACACTACCGTAAAAG 
CAGCCGCTTTCCTGACCCCAAAAAMCTCCMTGCGTGMCGCMTGATGMTGATCAGCTGATGTTTTTAAA 
CGCTGGCTTACCTTCTATACCGGTTCATCCAGTCAAGGCCGCGGCATTGGGTACGACGTGTTATGTTGGAGCA 
GCGATACTTCTTTGGCAGCCCATACCAGTAM7TTTTTMGACCTAGATCCTTACMTGCGTCAMGCATTCC 
TTACACTCTCAGTAACTTGGATCGGAGTCAATGCTCTGCTATATAGCCTCGTACACAACTTGGGCGCGGCCAC 
ACTTATGAGTGCAATGACGAATTTAGCTAAGTTCGTGGCGGCCTGGACTCTAAAGGCCGCAGCA 

HIV- 1043 

MEKVYLAWVPAHKGIGGGPGPGQKQITKIQNFRVYYRGPGPGWEFVNTPPLVKLWYQGPGPGYRKILRQRKID 
RLIDGPGPGQHLLQLTVWGIKQLQGPGPGGEIYKRWIILGLNKIVRMYGPGPGQGQMVHQAISPRTLNGPGPG 
IKQFINMWQEVGKAMYGPGPGWAGIKQEFGIPYNPQGPGPGKTAVQMAVFIHNFKRGPGPGSPAIFQSSMTKI 
LEPGPGPGEVNIVTDSQYALGIIGPGPGHSNWRAMASDFNLPPGPGPGAETFYVDGAANRETKGPGPGGAVVI 
QDNSDIKVVPGPGPGFRKYTAFTIPSINNE 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACCTGGGCAGAAAC 

AGATCACCAAGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAGGTTGGGAGTTTGTGAACACACC 

ACCCTTAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGATACCGTAAAATCCTGAGGCAAAGAAAGATAGAT 

CGCCTCATTGATGGCCCGGGCCCAGGCCAGCACCTTCTGCAGCTTACAGTGTGGGGAATTAAACAGCTGCAGG 

GGCCGGGCCCCGGGGGGGAAATTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGG 

CCCTGGACCCGGACAGGGGCAGATGGTCCACCAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAGGA 

ATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAAGGCTATGTACGGTCCCGGCCCTGGATGGGCAGGGA 

TAAAACAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGTAAAACGGCAGTGCAGATGGCCGT 

GTTCATTCATMTTTTAAGCGGGGCCCTGGACCTGGCAGCCCAGCTATATTTCAAAGTTCGATGACCAAAATC 

TTGGAGCCCGGCCCAGGGCCGGGCGAAGTGAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGC 

CCGGACCAGGGCATTCCAATTGGCGCGCCATGGCGTCTGACTTTAATCTACCTCCTGGGCCAGGCCCTGGCGC 

GGAAACTTTCTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCATT 

CAGGACAACTCAGACATCAAGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCGCCTTCACTATTC 
CGTCCATCAACAATGAGTGA 
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HIV- 1043 PADRE 

MEKV Y LAWVPAHKGI GGGPGPGQKQ I TK I QNFRVYYRGPGPGWE F VNTPP LVKLWYQGPGPGYRK I LRQRK I D 
RLIDGPGPGQHLLQLTVWGIKQLQGPGPGGEIYKRWIILGLNKIVRMYGPGPGQGQMVHQAISPRTLNGPGPG 
IKQFINMWQEVGKAMYGPGPGWAGIKQEFGIPYNPQGPGPGKTAVQMAVFIHNFKRGPGPGSPAIFQSSMTKI 
LEPGPGPGEVNIVTDSQYALGIIGPGPGHSNWRAMASDFNLPPGPGPGAETFYVDGAANRETKGPGPGGAVVI 
QDNSDIKVVPGPGPGFRKYTAFTIPSINNEGPGPGAKFVAAWTLKAAA 

ATGGAGAAGGTGTACCTGGCCTGGGTTCCAGCCCACAAAGGCATCGGGGGAGGGCCCGGACCTGGGCAGAAAC 

AGATCACCAAGATCCAGAACTTCCGGGTATACTACCGGGGACCTGGTCCAGGTTGGGAGTTTGTGAACACACC 

ACCCTTAGTAAAGCTCTGGTACCAGGGCCCCGGTCCCGGATACCGTAAAATCCTGAGGCAAAGAAAGATAGAT 

CGCCTCATTGATGGCCCGGGCCCAGGCCAGCACCTTCTGCAGCTTACAGTGTGGGGAATTAAACAGCTGCAGG 

GGCCGGGCCCCGGGGGGGAMTTTATAAAAGGTGGATCATTCTGGGTCTGAACAAGATCGTCCGCATGTATGG 

CCCTGGACCCGGACAGGGGCAGATGGTCCACCAAGCAATCAGCCCTCGAACCTTGAATGGACCGGGCCCAGGA 

ATCAAGCAATTCATTAACATGTGGCAAGAAGTTGGTAAGGCTATGTACGGTCCCGGCCCTGGATGGGCAGGGA 

TAAMCAGGAGTTTGGAATCCCTTACAATCCCCAGGGTCCTGGGCCAGGTAAAACGGCAGTGCAGATGGCCGT 

GTTCATTCATMTTTTMGCGGGGCCCTGGACCTGGCAGCCCAGCTATATTTCAMGTTCGATGACCAAAATC 

TTGGAGCCCGGCCCAGGGCCGGGCGAAGTGAACATTGTCACAGATTCTCAGTATGCCCTCGGCATCATAGGGC 

CCGGACCAGGGCATTCCAATTGGCGCGCCATGGCGTCTGACTTTAATCTACCTCCTGGGCCAGGCCCTGGCGC 

GGAAACTTTCTATGTGGACGGCGCTGCAAACAGGGAGACTAAGGGACCCGGACCCGGCGGCGCTGTAGTCATT 

CAGGACMCTCAGACATCMGGTGGTTCCCGGTCCAGGCCCCGGGTTCAGAAAGTATACCGCCTTCACTATTC 

CGTCCATCAACAATGAGGGCCCCGGCCCAGGTGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTG 
A 

HIV 75mer 

EKVYLAWVPAHKGIGGPGPGQGQMVHQAISPRTLNGPGPGSPAIFQSSMTKILEPGPGPGFRKYTAFTIPSIN 
NE 

GAGAAGGTGTACCTGGCCTGGGTGCCTGCCCACAAGGGAATCGGAGGACCTGGCCCTGGACAGGGACAGATGG 

TGCACCAGGCCATCAGCCCTAGGACCCTGAACGGACCTGGACCTGGAAGCCCTGCCATCTTCCAGAGCAGCAT 

GACCAAGATCCTGGAGCCCGGACCTGGACCTGGATTCAGGAAGTACACCGCCTTCACCATCCCCAGCATCAAC 
AACGAGTGA 
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PfHTL 

MQVQIQSLFLLLLWVPGSRGRHNWVNHAVPLAMKLIGPGPGKCNLYADSAWENVKNGPGPGKSKYKLATSVL 
AGLLGPGPGQTNFKSLLRNLGVSEGPGPGSSVFNVVNSSIGLIMGPGPGVKNVIGPFMKAVCVEGPGPGMNY 
YGKQENWYSLKKGPGPGGLAYKFVVPGAATPYGPGPGPDSIQDSLKESRKLNGPGPGLLI FHI NGKI IKNSE 
GPGPGAGLLGNVSTVLLGGVGPGPGKYKIAGGIAGGLALLGPGPGMRKLAILSVSSFLFV 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGATCCAGAGGAAGGCAC 
AACTGGGTGAATCATGCTGTGCCCCTGGCTATGAAGCTGATCGGCCCTGGACCAGGGAAATGCAACCTCTAC 
GCAGACAGCGCCTGGGAGAACGTCAAGAATGGCCCCGGACCTGGGAAATCCAAGTATAAGCTCGCTACCTCT 
GTGCTGGCAGGCCTGCTCGGACCAGGCCCCGGACAGACAAATTTCAAAAGCCTGCTCAGAAACCTGGGAGTG 
TCCGAGGGGCCTGGCCCAGGATCTAGCGTCTTTAATGTGGTCAACTCCTCTATTGGGCTCATCATGGGACCC 
GGACCTGGGGTGAAAAATGTCATTGGCCCATTCATGAAGGCCGTGTGTGTCGAAGGACCCGGGCCTGGCATG 
AACTACTATGGAAAGCAAGAAAATTGGTACAGCCTGAAGAAAGGCCCTGGGCCAGGCGGACTGGCTTACAAG 
TTTGTGGTCCGAGGGGCAGCCACTCCCTATGGGCCTGGGCCAGGCCCCGATTCCATCCAGGACTCTCTCAAA 
GAGAGCCGGAAACTGAACGGACCCGGGCCTGGACTGCTCATTTTCCACATCAATGGCAAAATTATCAAGAAC 
AGCGAGGGACCTGGGCCAGGCGCCGGACTGCTGGGGAACGTGTCCACCGTCCTGCTCGGCGGAGTGGGGCCC 
GGCCCTGGGAAGTACAAGATCGCTGGAGGGATCGCAGGCGGACTGGCCCTCCTGGGCCCAGGACCAGGGATG 
CGCAAACTGGCTATTCTCTCTGTCTCCAGCTTTCTGTTTGTGTGA 
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Protein 


Sequence 


Restriction 


HIV gag 386 


VLAEAMSOV 

» i~r\L.r>) iwy v 


HI A-AP 


HIV gag 271 


MTNNPPIPV 


HLA-A2 


HIV pol 774 


MASDFNLPPV 


HLA-A2 


HIV pol 448 


KLVGKLNWA 


HLA-A2 

1 1 1 — n nc 


HIV pol 163 


LVGPTPVNI 


HLA-A2 


HIV pol 498 


ILKEPVHGV 


HI A-AP 

1 ILn nc 


HIV pol 879 


KAACWWAGI 


HLA-A? 

1 1 l_n nC 


HIV pol 132 


KMIGGIGGFI 


HI A -A? 

1 ILn nc 


HIV pol 772 


RAMASDFNL 


HI A-AP 

1 IL_rA nc 


HIV pol 183 


TLNFPISPI 

■ Ul 111 A w I X 


HI A-AP 


HIV env 134 


KLTPLCVTL 


HI A.AP 
nLM "Mc 


HIV env 651 


LLOLTVWGI 


HI A-AP 

lILn* Mc 


HIV env 163 


SLLNATDIAV 


Hl A-AP 

nLn nc 


HIV nef 221 


LTFGWCFKL 


HI A-AP 
nLn nc 


HIV vpr 59 


AIIRIL00L 

n X J. 1 \ X 1 — V-iV^ 1 — 


HI A-AP 
rlLM- Mc 


HIV vpr 62 


RILOOLLFI 

l\X Lv(^LLI X 


HI A A9 
rlLM-Mc 


HIV pol 929 


OMAVFIHNFK 


HI A-A^ 


HIV pol 722 


KVYLAWVPAHK 


HI A-A^ 
nLn "MO 


HIV pol 971 


KIONFRVYYR 

IXXv(ll|| l\V 1 1 1 \ 


HI A-A^ 
nLM- MO 


HIV do! 347 


AIFOSSMTK 


HI A-A^ 
nLM -MO 


HIV pol 98 


VTIKIGGOLK 

v i xrxx\jvjv(L.ix 


HI A-A^ 
nLM -MO 


HIV env 61 


TTLFCASDAK 


HI A-A^ 
nLM- MO 


HIV env 47 


VTVYYGVPVWK 


HI A -AT 
nLM "MO 


HIV nef 100 


OVPLRPMTYK 

x* i Li\n 1 1 iix 


HI A-A^ 
nLM "MO 


HIV vif 7 


VMIVWOVDR 

V 1 1 X V fw\-( V L/l\ 


HI A. A^ 
nLM - MO 


HIV gag 162 


OMVHOAISPR 

v^i i w i lynivi I \ 


HI A-A'} 
nLn MO 


HIV gag 545 


YPLASLRSLF 


HI A-R7 
nLM- D / 


HIV gag 237 


HPVHAGPIA 

■ II 1 1 II 1V«II -LI \ 


HI A-R7 

ncn D / 


HIV pol 186 


FPISPIETV 

1 1 XsJl lU 1 V 


HI A-R7 
ni_M d / 


HIV pol 893 


IPYNP0S0GVV 

* 1 1 1 »l V^^/V^\4 w ¥ 


HI A-R7 

nLn o/ 


HIV env 259 


IPIHYCAPA 

i ^ i i i vni #i 


HI A-R7 


HIV env 250 


CPKVSFEPI 


HLA-B7 

i iLn v / 


HIV nef 94 


FPVRPQVPL 


HLA-R7 

i ii— n ui 


HIV rev 75 


VPLQLPPL 


HLA-B7 

1 ILn U / 


HIV pol 684 


EVNIVTDSQY 


HLA-A1 

1 ILn nx 


HIV gag 317 


FRDYVDRFY 


HLA-A1 

1 ILn nx 


HIV pol 368 


VIYQYMDDLY 


HLA-A1 


HIV pol 295 


VTVLDVGDAY 


HLA-A1 


HIV pol 533 


IYQEPFKNL 


HLA-A24 


HIV pol 244 


PYNTPVFAI 


HLA-A24 


HIV pol 530 


TYQIYQEPF 


HLA-A24 


HIV pol 597 


YWQATVIIPEW 


HLA-A24 


HIV env 681 


IWGCSGKLI 


HLA-A24 


HIV env 671 


RYLKDQQLL 


HLA-A24 
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n w i/C 1 1 1 


oequence 


Restriction 


UTW Qm . CC 

nxv env oo 


VWKEAl 1 ILF 


HLA-A24 


HTV vnr 

1 1 ± v v pi *tU 


lit 1 YbUI W 


III a A O A 

HLA-A24 


HTV vnr 14 

MJ.V V|J| ±f 


DVMCUTI n 
rilMtWILtL 


in a « n a 

HLA-A24 


HTV nan °Qft 
nxv yay Cyo 


1/DlilTTI PI MI/TWDMV 

I\KW1 1 LuLNKI VKM Y 


1 II a nn 

HLA-DR 


HTV nnl ciQfi 
nxv pu i Oi7u 


lilETIT\/MTDDI IJVn 

Wtr VIM 1 rr LVKLWYQ 


I II A r\n 

HLA-DR 


HTV nnl Q^fi 
niv pu i i70D 


ni/n TTV THM CD\/VVD 

UKU1 iNlUNrKVYYK 


1 II A nr\ 

HLA-DR 


HTV nnl 719 
nxv po 1 /l£ 


K V Y LAW V PAHKG I GG 


HLA-DR 


UTV non 0O/1 

niv gay ^y4 


OF" T vi/ni IT T i ni MI/T 

GEIYKRWIILGLNKI 


HLA-DR 


HTV nnl 71 1 

nxv po i /ii 


ri/wvi ai iwn a i n/n to 

EKVYLAWVPAHKGIG 


HLA-DR 


HTV nn\# 79G 

niv env /^y 


AMI 1 r\| T*WI iati/ai n 

QHLLQLTVWGIKQLQ 


HLA-DR 


HTV nan 171 

nxv gag l/l 


QGQMVHQAISPRTLN 


HLA-DR 


UTV r\r\l QQC 

niv pol ooo 


PHA T rAPCUTl/ T i r~r\ 

SPAIFQSSMTKILEP 


HLA-DR 


UT\/ <«k^\/ CCC 

niv env ooo 


IKQFINMWQEVGKAMY 


HLA-DR 


UTV r\r\l OfkO 

HIV pol oUo 


FRKYTAFTIPSINNE 


HLA-DR 


UTW 7co 

HIV pol 758 


HSNWRAMASDFNLPP 


HLA-DR 


utw mc 

HIV pol 915 


KTAVQMAVFIHNFKR 


HLA-DR 


UTW wrvi i 01 

HIV Vpu 31 


YRKILRQRKIDRLID 


HLA-DR3 


UTW vx^l n~7 A 

HIV pol 874 


WAGIKQEFGIPYNPQ 


HLA-DR3 


UTW CIA 

HIV pol 674 


EVNIVTDSQYALGII 


HLA-DR3 


UTW C\t\ 

HIV pol 619 


AETFYVDGAANRETK 


HLA-DR3 


UTW ~i non 

HIV pol 989 


GAVVIQDNSDIKVVP 


HLA-DR3 


UP*/ kicvi 1010 

HLV NS4 1812 


LLFNILGGWV 


HLA-A2 


UPW MCI /CO TOO 

HCV N51/E2 728 


FLLLADARV 


HLA-A2 


ur*w mc a i con 

HCV NS4 1590 


YLVAYQATV 


HLA-A2 


U/**W MCC O^l 1 

HLV Nbo 2611 


RLIVFPDLGV 


HLA-A2 


ur*w rnnr ioo 

HLV LORE 132 


DLMGYIPLV 


HLA-A2 


UPW mc/i inon 
HLV No4 iy2U 


I iinin i t a r* a 

WMNRLIAFA 


HLA-A2 


HLV No4 looo 


VLVGGVLAA 


HLA-A2 


UPV MC>1 1 7£0 
MLV INo4 1/oy 


I J kill JHPTrr»T 

HMWNFISGI 


HLA-A2 


UP\/ MC/ loci 
HLV N54 lobl 


ILAGYGAGV 


HLA-A2 


upv rnDc 
HLV LUKt OO 


\/i I nnnonni 

YLLPRRGPRL 


HLA-A2 


HPV MCI /T9 79A 
HLV lNol/t£ /^o 


1 1 CI 1 1 AHA 

LLFLLLADA 


HLA-A2 


HPV 1 HDC 1 1 Q1 
HLV LUKr llol 


vi WTnuAnw 

YLVTRHADV 


HLA-A2 


UPV phdit ni 
nLV LUKt Ol 


i/TcrncAhn 

KTSERSQPR 


HLA-A3 


upv phdf aq 

nuV LUKt H-O 


RLGVRATRK 


HLA-A3 


HPV FMV1 90H 

hlv tiNvi ^yu 


QLFTFSPRR 


HLA-A3 


HPV MCI /CO coo 
nLV Nol/ tc Oo£ 


RMYVGGVEHR 


HLA-A3 


HPV IMS? IIQfi 


1 TFPUCI/Vk' 
LlrLHol\l\l\ 


III A AO 

HLA-A3 


HCV NS4 1863 


GVAGALVAFK 


HLA-A3 


HCV NS4 1864 


VAGALVAFK 


HLA-A3 


HCV NS3 1262 


LGFGAYMSK 


HLA-A3 


HCV Core 169 


LPGCSFSIF 


HLA-B7 


HCV NS5 2922 


LSAFSLHSY 


HLA-A1 


HCV NS3 1128 


CTCGSSDLY 


HLA-A1 


HCV NS5 2180 


LTDPSHITA 


HLA-A1 
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Protein 


Seauence 


RpQ"h pt n nn 
rvColl ILL lUfl 


HCV Core 126 


1 TPflFAni MAY 
L 1 UUrrtULrlvl I 


Ml A Al 


HCV NS3 1305 


LADGGCSGGAY 


HLA-A1 


HCV NS4 1765 


FWAKHMWNF 

i nrwNi ii ivvivi 


HI A-AP4 


HCV NS5 2875 


RMILMTHFF 

1 \l 1 X U-l 1 1 1 1 1 1 


HI A-AP4 


HCV NS5 2639 


VMGSSYGF 

■ 1 IM w w | VII 


HI A-AP4 


HCV NS4 1765 


FWAKHMWNFI 

i »»nixi ii in lit x 


HI A-APA 

rlLM r\c.'-r 


P. falciparum SSP2-230 


FMKAVCVEV 

1 IHviVvVL.V 


HI A-A9 


P. falciparum EXP1-83 


GLLGVVSTV 

vJ 1— L.VJ V VO 1 V 


HI A.A9 


P. falciparum CSP-7 


ILSVSSFLFV 


HI A-A9 


P. falciDarum LSA1-94 

■ • • • >^ • Mill ks/ft^ I 


OTNFKSI 1 R 


HI A-A^ 


P, falciparum LSA1-105 

■ • » >4 » Nrf 1 ^Ml Mill UVDJ. X V \J 


GVSFNTFI \C 


Ml A.A^ 
FILM -Mo 


P. falcioarum SSP2-522 




Ml A A^ 


P. falciparum SSP2-539 

* • ■ • w • |^V4I Mill VSSJt fc— \J\J*S 


TPYAfiFPAPF 


Ml A R7 


P. falcioarum LSA1-1663 


1 PSFNFRGY 

L.I OCIMClxvl I 


Ml A- Al 


P. falciparum EXP1-73 

■ • 1 1 » I^V«I Mill ^/\l JL / 




Ml A A9/I 


P. falcioarum CSP-12 

■ • l 1 1 VI 1 Mill Wwl X- L_ 


QFI FVFAI F 
or Lr V urM-r 


Ml A AOA 
rlLM- A^^f 


P. falciparum LSA1-10 

■ « 1 M 1 V 1 iyui Mill UV/lX .X V 


YFTI VNI 1 T 

1 riL.VllL.Ll 


Ml A hOA 


P. falcioarum SSPP-14 

1 • 1 U 1 V* 1 |m/VI( Mill MWT C X 1^ 


r li rruLrLv 


Ul A AO 
MLA-A^i 


P falcioarum EXPl-8n 


V LMULLUVV 


Ml A AO 

nLA-A^ 


P. falcioarum EXP1-91 

• • 1 M 1 Vj» 1 yS M 1 Mill LAI X Z/ X 


vi i nnvni VI 

V LLUUVuLV L 


Ml A AO 
nLA- Aci 


P. falcioarum SSP2-5?^ 

■ • • M 1 V» 1 L/UI Mill \/<jrL sj C-\J 


1 APAGI AYk' 


Ml A AQ 

nLA-Ao 


P. falcioarum EXP1-10 

« • I U I V ipui 141 II LAI X XU 


Al FFTTFMk' 
MLr r 1 l r INix 


Ml A AQ 
nLA- Ao 


P. falciDarum LSA1-11 

• « ■ W 1 w 1 L/UI Mill LJnJ. XX 


FTI VMI 1 TFH 
r iLviiLLirn 


Ml A AQ 

nLA- Ao 


P. falciDarum SSP2-l?fi 

• • 1 M 1 \^ 1 UUI Ml 1 1 sJsJl C— XC.V 


1 PYfiRTNl 

Lr I Ur\ 1 IML 


Ml A R7 
nLA- Of 


P. falciDarum CSP-15 

• • 1 M 1 W 1 L^MI Mill V/%/| X. v/ 


FVFAI FOFY 


Ml A Al 
MLA-Al 


P. falciparum LSA1-1794 

■ • ' ^* • 1 L/U 1 Mill UVI \X X / 1 


FDDFFNTfiTY 

rv{L/LLIlldl 1 


Ml A Al 
nLA-Al 


P. falciparum LSA1-9 

■ • • m • i u>u i miii uvnx «y 


FYFTI VNI 1 

1 1 r 1L V liLL 


Ml A ASM 


P. falciparum SSP2-8 

■ • 1 M 1 W 1 p/UI Mill Owl C- W 


KYI VTVFI T 

M LVl V ILI 


Ml A AOA 
rlLA-Acn 


P. falciparum CSP-394 


Gl TMVI SFI 

UL1I IV LOT L 


Ml A.A9 
nLA- Ac 


P. falciparum EXP1-2 

■ • . » v ■ l/ui Mill X> £— 


KTI ^VFFI A 


Ml A A9 
nLA- t\c. 


P. falciparum CSP-344 

■ • • 1 W • y*r V4 1 Mill WS/I M 1 I 


VTPGNfSTOVR 


Ml A.AQ 
nLA- AO 


P. falciparum LSA1-59 


HVL SHNSYFK 


HI A.A^ 
nLM" MO 


P. falciparum SSP2-207 


PSDGKCNLY 


HI A-A1 


P. falciparum LSA1-1671 


YYIPH0SSL 


HI A-APA 


P. falciparum LSA1-1876 


KFIKSLFHIF 

1X1 X 1 VsJ L_ 1 1 IX 1 


HI A- AP4 
rlLn*nt.*T 


P. falciparum SSP2-13 


VFLIFFDLFL 

VI LXI | \J L- 1 l_ 


HI A-APA 


P. falciparum LSA1-1881 


LFHIFDGDNEI 

f— 1 1 111 \J\AW\ 1 L_ X 


HI A- APA 


P. falciparum CSP-55 


YYGK0ENWYSL 

1 ■ vllW^LIlM I JL 


HI A-AP4 


P. falciparum LSA1-5 


LYISFYFI 

t— 1 X \J 1 1 1 ± 


HI A-APA 

nLrA r\t*t 


P. falciparum CSP-2 


MRKLAILSVSSFLFV 

III M \ Uv ¥ Jvl L_ 1 V 


HLA-DR 


P. falciparum CSP-53 


MNYYGKQENWYSLKK 


HLA-DR 


P. falciparum CSP-375 


SSVFNVVNSSIGLIM 


HLA-DR 


P. falciparum SSP2-61 


RHNWVNHAVPLAMKLI 


HLA-DR 


P. falciparum SSP2-165 


PDSIQDSLKESRKLN 


HLA-DR3 


P. falciparum SSP2-211 


KCNLYADSAWENVKN 


HLA-DR3 
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Protein 


Sequence 


Restriction 


P. falciDarum SSP2-223 

■ * * *>» ■ v 1 l/VII Mill WV/I 1— LbV 


VKNVIGPFMKAVPVF 


Ml A. no 
rlLM" UK 


P. falciparum SSP2-509 


KYKIAGGIAGGLALL 

i » i xrivivi invivi LfiUL 


HLA-DR 

nun uw 


P. falciparum SSP2-527 


GLAYKFVVPGAATPY 

«U» 1 Ixl ¥ ¥ 1 \JUu \ II 1 


HI A-DR 

MUrA UW 


P. falciparum EXP1-71 


KSKYKLATSVLAGLL 

« Wl X | INI— f \ 1 VI LJtVJUU 


HLA-DR 


P. falciparum EXP1-82 


AGLLGNVSTVLLGGV 


HI A-I1R 
riL/A * uw 


P. falciparum LSA1-16 


LLIFHINGKIIKNSF 

L.L-XI 1 1 X 1 1VIIX X X l\l Wl_ 


HI A-HR 

MUM UW 


P. falciparum LSA1-94 


OTNFKSLLRNL GVSF 

Vfllll IX^JLLIM 1 LVJ V JL 


HI A-HR 
nLM" uw 


HBV core 18 


FLPSDFFPSV 


HI A- A? 
nLM -Ml 


HBV env 183 


FLLTRILTI 

1 LL 1 l\x L. 1 1 


HI A- AO 
nLM-ML 


HBV env 335 


Wl SI 1 VPFV 


HI A-AO 
nLA- Ac 


HBV doI 455 


GLSRYVARI 

\jloi\i vn[\L 


HI A. AO 


HBV doI 538 


YMnnvvi gv 

1 1 \UU v V LUIV 


Ul A A9 / A 1 


HBV do! 773 


Tl RGTSFVYV 
x lfwj i or v i v 


HI A A9 


HBV doI 562 

» I l— ' ¥ la/ W 1 \J \Jtmm 


Fl 1 SI GTHI 

i LLOLulnL 


Ul A AO 


HBV doI 642 


Al MPI YAPT 

MLur L IMUx 


141 A AO 


HBV env 338 

1 1 \-J ¥ V 1 1 V \J\J\J 


fil SPTUWI 
ULOr 1 V WLO V 


Ul A AO 


HBV core 141 


STI PFTTVVRR 

O 1 LrC 1 1 V V i\r\ 


HI A A^ 
nLA-Ao 


HBV doI 149 


HTI IaiYAGTI Yk" 
n i lwinMu x l t is. 


Ul A A*3 / A 1 

MLA-AJ/A1 


HBV doI 150 


Tl UK'AftTI Ytf 


Ul A AO 

HLA-Ao 


HBV do! 388 


lv vuroi^roiA 


Ul A AO 
nLA-Ao 


HBV doI 47 


IWSTPWTHk 
n vjirH l nix 


Ul A AO 

nLA-Ao 


HBV doI 531 


OnlV/J V V r\r\ 


Ul A AO 
nLA-Ao 


HBV doI 629 


KVRNIFTfil Y 


HI A-AO/A1 
nLA-Ao/ Al 


HBV doI 665 


OAFTFSPTYK 


HI A -AO 
nLA-Ao 


HBV core 19 


LPSDFFPSV 
Lr%;ur r ro v 


HI A-R7 
flLM* D / 


HBV env 313 


IPIPSSWAF 


HI A-R7 
nLr\- D/ 


HBV doI 354 


TPARVTGGVF 


HI A-R7 
nLM- D/ 


TB 


RMSRVTTFTV 

l\l V III IV 


HI A-A9 


TB 


ALVLLML PVV 

/VI— V L.L.I II— 1 V V 


Hl A-A9 


TB 


LMIGTAAAVV 

li i ivj i nnn v v 


HI A.A9 


TB 


ALVLLMLPV 


HI A- A? 
nLM ■ t\c. 


TB 


GLMTAVYLV 

MLI 1 In* 1 LV 


HI A- A? 

nLM Ml 


TB 


MALLRLPV 


HI A-AP 

nLM Ml 


TB 


RMFAANLGV 


HI A-AP 

nLM Ml 


TB 


SLYFGGICV 


HI A.AP 
nLM " Ml 


TB 


RLPLVLPAV 

l\l_l LV LI nv 


HI A-AP 

nLM " Ml 


TB 


RLMIGTAAA 

• \ i— 1 1 x \j i rwi 


HI A- A? 
n lm Ml 


TB 


FVVALIPLV 


HLA-A2 

1 1 LrA r\C 


TB 


MTYAAPLFV 


HLA-A2 


TB 


AMALLRLPLV 


HLA-A2 


p53 139 


KLCPVQLWV 


HLA-A2 


CEA 687 


ATVGIMIGV 


HLA-A2 


CEA 691 


IMIGHLVGV 


HLA-A2 


Her2/neu 689 


RLLQETELV 


HLA-A2 


MAGE3 112 


KVAEIVHFL 


HLA-A2 
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Protein 


Sequence 


Restriction 


Her2/neu 665 


VVLGVVFGI 


HLA-A2 


p53 149 


SMPPPGTRV 


HLA-A2 


PAP.21.T2 


LTFFWLDRSV 


HLA-A2 


PAP. 112 


TLMSAMTNL 


HLA-A2 


PAP. 284 


IMYSAHDTTV 


HLA-A2 


PSM.288.V10 


GLPSIPVHPV 


HLA-A2 


PSM.441 


LLQERGVAYI 


HLA-A2 


PSM.469L2 


LLYSLVHNL 


HLA-A2 


PSM.663 


MMNDQLMFL 


HLA-A2 


PSA.3.V11 


FLTLSVTWIGV 


HLA-A2 


PSA. 143. V8 


ALGTTCYV 


HLA-A2 


PSA. 161 


FLTPKKLQCV 


HLA-A2 


HUK2.4.L2 


LLLSIALSV 


HLA-A2 


HuK2.53.Vll 


VLVHPQWVLTV 


HLA-A2 


HUK2.165 


FLRPRSLQCV 


HLA-A2 


HuK2.216.Vll 


PLVCNGVLQGV 


HLA-A2 
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ID# 








HLA 


D^Af Art 1 r\ f\ 

rTototype 


VDM 1 


epitope 


Sequence 


Conservation 


restriction 


binding 


XKN 


924.07 


core 18 


FLPSDFFPSV 


45 


A2 


3.5 


5 


777.03 


env 183 


FLLTRILTI 


80 


A2 


9.8 


4 


1013.01 


env 335 


WLSLLVPFV 


100 


A2 


5.4 


4 i 


1 1 kr n? 


r\f\\ A^\^\ 
pUI *tJJ 


P| CRYVAR! 
wLDix 1 VnTxL 


jj 


A9 


oo.y 


0 


1090.77 


doI 538 


YMDDWLGV 


90 


A2/A1 


64 

W.I 


5 

w 


927 11 


doI 562 

WWI uvL 


FLLSLGIHL 


95 


A2 


78 


w 


927.15 


pol 642 


ALMPLYACI 


95 


A2 


12.9 


4 


1083 01 


rorp 141 

WWl V> 1 1 1 


STLPETTWRR 

J 1 LI L 1 1 t VIM\ 


95 


A1/A1 1 


7TS/4 *S 


4 


1147 16 


doI 149 


HTLWKAGILYK 


100 


A3/A1 

r\\j 1 n 1 


15 4/15 6 




1069 15 
i wis, i \j 


ool 150 


TI WKAGILYK 


100 

1 WW 


A3/A1 1 

t\\jf r\ 1 1 


? 1/^ 




1069 20 


ool 388 


LWDFSOFSR 


100 

1 WW 


AVA11 


RR75/17 


7 5 


1069.16 


pol 47 


NVSIPWTHK 


100 


A3/A11 


174/117 


3 ! 


1090.11 


pol 531 


SAICSWRR 


95 


A3/A1 1 


2189/29 


3 


1142.05 


pol 629 


KVGNFTGLY 


95 


A3/A1 


58/365 


2 


1090.10 


ool 665 


QAFTFSPTYK 




A3/A1 1 


249/8 


3 


988.05 


core 1 9 


LPSOFFPSV 


45 


B7 


30268 


4 


1145.04 


env 313 


IPIPSSWAF 


100 


B7 


42.3 


4 


1147.04 


pol 354 


TPARVTGGVF 


90 


B7 


13.2 


2 


[ Tf4"7.0"2 


j)ol 429 


HPAAMPHLL 


100 


B7 


56.6 


4 


> 1039.06 


env 359 


VyMMWYWGPSLY 


85" 


A1 


16.3 


3 


1448.01 


core 419 


DLLDTASALY 


75 


A1 


2.3 


3 


1373.88 


core 137 


LTFGRETVLEY 


75 


A1 


80.0 


3 


1090.07 


pol 415 


LSLDVSAAFY 


95 


A1 


6.0 


3 


20.0271 


pol 392 


SWPKFAVPNL 


95 


A24 


2.1 


2" 


1373.56 


env 332 


RFSWLSLLVPF 


100 


A24 


12.0 


2 


1373.07 


core 117 


EYLVSFGVW 


90 


A24 


16.0 


2 


1069.23 


pol 745 


KYTSFPWLL 


85 


A24 


1.0 


3 



XRN = Cross binding, number of HLA types in the supertype panel of 5 for which significant binding as detected 
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HBVi2 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVMWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVLGVGLS 
RYVARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVN I P I PSSWAFKTPARVTGGVFKVGNFTGL 
YNLPSDFFPSVKTLWKAGI LYKNVS I PWTHKGAALVVDFSQFSRNSAICSVVRRALMPLYAC I 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTGCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGG 
MGGCCGGMTCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGMGGCTGCCGCTTTCCTGCCTAGCGATTTC 
TTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCC 
AGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGG 
CAGGCC7TCACCT7TAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCT 
ATCCCTAGCTCCTGGGCTTTCMGACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGGCCTG 
TATMCCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATC 
CCTTGGACCCACMGGGAGCCGCTCTGGTGGTGGACTTTTCCCAGTTCAGCAGAAATTCCGCTATCTGCTCCGTGGTG 
AGGAGAGCTCTGATGCCACTGTATGCCTGTATCTGA 



FIG.20D 



HBV-2A 



MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVAAWTLKAAAFLPSDFFPSVNFLLSLGIHLYMDDVVLGVGLS 
RYVARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVN I P I PSSWAFKTPARVTGGVFKVGNFTGL 
YNLPSDFFPSVKTLWKAGILYKNVSIPWTHKGAALVVDFSQFSRNSAICSVVRRKAWMMWYWGPSLYKKYTSFPWLLN 
AHPAAMPHLLKAAADLLDTASALYNAAARFSWLSLLVPFNAASWPKFAVPNLKLTFGRETVLEYKALSLDVSAAFYGA 
AEYLVSFGVWGAALMPLYAC I 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGG 
MGGCCGGAATCCTGTATMGGCCMGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCCTAGCGATTTC 
TTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTGGGAGTGGGACTGTCC 
AGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAGAGACCACCGTGGTGAGGAGG 
CAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCTGCTGGTGCCCTTTGTGAATATCCCT 
ATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGAGTGTTTAAGGTGGGAAACTTCACCGGCCTG 
TATMCCTGCCCAGCGATTTCTTTCCTAGCGTGAAGACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATC 
CCHGGACCCACMGGGAGCCGCTCTGGTGGTGGACTTTTCCCAGTTCAGCAGAAATAGCGCCATCTGTTCGGTCGTG 
AGAAGGAAAGCCTGGATGATGTGGTACTGGGGTCCTAGTCTGTATAAGAAGTACACCTCATTCCCATGGCTCTTGAAT 
GCCCATCCCGCTGCAATGCCACACCTGCTTAAAGCTGCGGCGGATCTGCTGGACACAGCCTCAGCTTTATATAATGCT 
GCAGCMGATTCTCCTGGTTGTCTCTCTTAGTGCCCTTCMCGCAGCTTCCTGGCCAAMTTTGCCGTTCCGAACCTG 
MGCTCACTTTTGGAAGAGAGACAGTACTTGAATACAAAGCACTAAGCCTTGACGTGTCAGCAGCCTTCTACGGAGCA 
GCAGMTATCTAGTATCTTTTGGGGTCTGGGGCGCAGCCCTCATGCCTCTATACGCCTGCATTTGA 



FIG.20E 
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■ HBV-2B 

MGMQVQIQSLFLLLLWVPGSRGHTLWKAGILYKAKFVMWTLKAMFLPSDFFPSVNFLLSLGIHLYMDDVVL 
GVGLSRYVARLFLLTRI LTISTLPETTVVRRQAFTFSPTYKGAAAWLSLLVPFVN I P I PSSWAFKTPARVTGG 
VFKVGNFTGLYNLPSDFFPSVKTLWKAGILYKNVSIPWTHKGAALVVDFSQFSRNSAICSVVRRKEYLVSFGV 
WGLSLDVSAAFYNAAAKYTSFPWLLNAHPAAMPHLLKAAADLLDTASALYNSWPKFAVPNLKLTFGRETVLEY 
KAAWMMWYWGPSLYKAAARFSWLSLLVPFGAAALMPLYACI 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCC 
TGTGGAAGGCCGGAATCCTGTATAAGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTTTCCTGCC 
TAGCGATTTCTTTCCTAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGGATGACGTGGTGCTG 
GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGTTCCTGCTGACCAGAATCCTGACCATCTCCACCCTGCCAG 
AGACCACCGTGGTGAGGAGGCAGGCCTTCACCTTTAGCCCTACCTATAAGGGAGCCGCTGCCTGGCTGAGCCT 
GCTGGTGCCCTTTGTGAATATCCCTATCCCTAGCTCCTGGGCTTTCAAGACCCCAGCCAGGGTGACCGGAGGA 
GTGTTTMGGTGGGAMCTTCACCGGCCTGTATMCCTGCCCAGCGAnTCTTTCCTAGCGTGAAGACCCTGT 
GGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGCCGCTCTGGTGGTGGACTT 
TTCCCAGTTCAGCAGAMTTCAGCMTTTGTTCGGTGGTGAGMGAMGGMTATCTTGTTTCATTTGGCGTC 
TGGGGGCTGTCACTGGATGTMGTGCGGCATTTTACMTGCCGCCGCAAAATATACAAGCTTCCCATGGCTCC 
TAAACGCACACCCAGCTGCAATGCCGCATCTACTGAAAGCAGCCGCTGACCTCTTAGACACTGCCTCCGCTCT 
GTACMCTCTTGGCCCMGTTTGCCGTGCCTAATCTCAAGTTGACCTTCGGTAGAGAGACAGTCTTAGAATAC 
AAAGCGGCCTGGATGATGTGGTACTGGGGACCCTCTCTGTATAAAGCCGCTGCAAGGTTCTCCTGGCTTAGCC 
TTCTCGTACCATTCGGAGCAGCTGCCCTAATGCCTTTGTACGCATGCATCTGA 



FIG.20F 
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HBV-21A 

MGMQVQIQSLFLLLLWVPGSRGSWPKFAVPNLKAAM^ 

WKAGILYKKAFLLTRILTIGALSLDVSMFYNAMKYTSFPWLLNAAARFSWLSLLVPFNAATPARVTGGVFKAAEYL 
VSFGVWGAMYMDDVVLGVNDLLDTASALYNAMFPHCUFSYMKAMWMMWYWGPSLYI<^ASAICSVVRRKNFLLSL 
GiHLNIPIPSSWAFKAAWLSLLVPFVNAFLPSDFFPSVKLTFGRETVLEYKQAFTFSPTYK 

ATGGGMTGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATCTTGGCCTAAA 

TTCGCAGTGCCAAACCTTAAAGCCGCGGCTGCTAAGTTCGTAGCTGCCTGGACACTAAAGGCCGCCGCTAAGAGCACA 

CTGCCAGAGACCACCGTGGTCCGGCGAAAGCATCCAGCCGCAATGCCCCACTTGCTCAAAGCAGCCGCCCACACTCTT 

TGGMGGCTGGGATATTGTACMGAAAGCCTTCCTTCTGACCAGGATATTAACTATCGGAGCTeTGTCACtCGACGTT 

TCTGCTGCCTTCTACMCGCGGCGGCAAMTACACTAGCT7TCCATGGCTACTCMCGCAGCCGCCAGATTTTCTTGG 

CTATCACTACTGGTGCCATTTMTGCAGCMCACCTGCTAGAGTGACTGGCGGCGTCTTTAAAGCAGCCGAGTACTTG 

GTGAGCTTTGGCGTCTGGGGTGCAGCGGCATATATGGATGATGTAGTGTTAGGGGTGAACGACCTCCTGGACACAGCC 

AGTGCGCTGTACMTGCAGCTGCATrCCCGCATTGCCTAGCCTTCAGTTATATGAAAGCAGCAGCCTGGATGATGTGG 

TACTGGGGACCGTCCCTTTATAMGCAGCTTCAGCMTCTGTTCCGTTGTGAGGAGAAAAMCTTTTTACTCT^ 

GGTATTCACCTGMCATTCCC ATCCCT TCCTCATGGGCATTCAMGCCGCTTGGCTGAGTCTACTCGTACCTTTCGTT 

MTGCATTTCTGCCCAGCGACTTTTTCCCCTCGGTAAMCTGACATTCGGACGCGAMCAGTCCTTGAATATAAGCAG 
GCCTTCACGTTCTCACCAACCTATAAATGA 



FIG.21D 



HBV-21B 



MGMQVQIQSLFLLLLWVPGSRGYMDDVVLGVNAAAEYLVSFGVWNDLLDTASALYGAAHTLWKAGILYKKAFLPSDFF 
PSVKAFPHCLAFSYMKMRFSWLSLLVPFNAASWPKFAVPNLKAAAQAFTFSPTYKNAAASAICSVVRRKAFLLTRIL 
TINIPIPSSWAFKMWMMWYWGPSLYKAMTPARVTGGVFKMNFLLSLGIHLNLTFGRETVLEYKHPMMPHLL^ 
STLPETTVVRRKWLSLLVPFVNAAAAKFVAAWTLKAAAKLSLDVSAAFYNAAAKYTSFPWLL 

ATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGATACATGGATGAC 

GTTGTGTTAGGCGTTAATGCAGCCGCAGAATATCTCGTGTCATTCGGCGTCTGGAACGACCTGTTGGACACTGCATCT 

GCTCTGTACGGTGCAGCCCATACCCTGTGGMGGCCGGMTCCTCTACAAAMGGCATTCCTACCTAGCGACTTTTTT 

CCTTCAGTGAMGCCTTCCCACATTGCCTAGCATTCTCGTATATGAAAGCGGCTAGG7TCTCATGGCTTAGTCTTCTA 

GTACCTTTCMTGCCGCCTCCTGGCCCAMTTCGCCGTACCAMTCTAAMGCGGCCGCGCAGGCCTTTACATTCTCT 

CCGACTTATAAAMTGCAGCAGCCTCCGCTATTTGTAGCGTCGTGCGCCGAMGGCCTTCCTGCTMCCCGGATTTTG 

ACGATAAACATCCCCATCCCTTCTAGCTGGGCTTTCAAAGCAGCATGGATGATGTGGTACTGGGGTCCCAGCTTATAC 

AAAGCTGCGGCAACCCCAGCAAGAGTGACAGGGGGCGTGTTTAAGGCCGCCAACTTCCTCCTGAGTCTCGGAATACAC 

CTGMCTTAACCTTTGGGAGAGAGACAGTACTGGAGTATAAACACCCAGCAGCTATGCCGCACCTACTCAAAGCCGCT 

TCMCACTCCCAGAMCMCTGTAGTGAGGAGAAAATGGCTCTCCCTGCTTGTCCCATTTGTCAACGCCGCCGCCGCT 

MGTTTGTGGCCGCTTGGACACTTMGGCTGCAGCAMGTTGTCACTTGATGTTAGTGCAGCGTTCTATAACGCAGCT 
GCAAAATACACTTCCTTTCCCTGGCTGCTGTGA 



FIG.21E 
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HBV-30B 

MGMQVQIQSLFLLLLWVPGSR6FLLTRILTINAAASWPKFAVPNLKAAAHTLWKAGILYKKADLLDTASALYNQAFTFS 

PTYKGAAAN VS I PWTHKGAAAFLLSLG I HLN I P I PSSWAFKAAALWFHI SCLTFKAAAI LLLC L I FLLNAAAYPALMPL 

YACINAHPAAMPHLLKAMSFCGSPYKMGLSRYVARLNKYTSFPWLLNFLPSDFFPSVKAFPH 

GVWNMLTFGRETVLEYKAMLPSDFFPSVKAYMDDVVLGVNLVVDFSQFSRNAMRWMCLRRFII^ 

FNAATPARVTGGVFKMWLSLLVPFVNSAICSVVRRKA^ 

LDVSAAFY 

SS^tgcaggtcca^tac^^^ 

GCATCCTGACMTTMCGCCGCAGCCTCCTGGCCAAAATTTGCCGTGCCAAATCTCAAGGCAGCTGCACACACACTATG 
GAMGCAGGGATACTGTACMGAMGCCGATCTGCTAGACACAGCGTCTGCGTTGTACMCCAGGCTTTTACTTTCTCT 
£S^ A I ATAAAGGCGCAGCTGCAAACGT ^ GTATCCC ™ 

I§ GGCA ICCATCTAMTATCCCTATTCCmATCCTGGGCAmAMGCAGCCGCCmTGGnCCAC 
GACCTTCAMGCCGCAGCMTCCTGCTCCTnGCCTCATmCTTACT^ 

TTGTACGCATGTATCMCGCCCACCCCGCAGCMTGCCCCACCTCCTTAAAGCTGCCGCCAGTTTCTGCGGTTCTCCTT 

ATAMGCAGCAGGGCTGTCCA^TACGTAGCTAGGCTAMCMGTATACCAGCTTCCCCTGGmCTTMTTTCCTGC^ 

GTCAGATTTCTTTCCATCAGTTMGGCCTTCCCTCATTGTCTGGCCTTTAGCTACATGMGGCTGMTATT^ 

TTCGGCGTGTGGAATGCGGCACTGACATTTGGAAGGGAGACAGTGCTCGAGTACAAAGCCGCCGCACTACCCTCGGACT 

TCTTCCCATCGGTCAMGCTTACATGGACGATGTAGTCCTCGGCGTTMCTTAGTAGTGGACTTTTCTCMTTTTCCAG 

AMCGCAGCGGCCAGATGGATGTGCCTTCGGCGTTTTATAATAAACGCCGCTCGATTCAGCTGGCTATCACTCCTAGTT 

CCATTTMTGCAGCTACACCCGCACGGGTGACAGGTGGAGTTTTCMGGCAGCGTGGCTTTCACTGC7TGTGCCATTTG 

TGMCTCAGCTATTTGCTCAGTAGTGAGAAGGAAGGCAAAATTCGTCGCTGCCTGGACTCTCAAAGCTGCCGCAAAGTG 

FIG.22D 



HBV-30C 

MGMQVQIQSLFLLLLWVPGSRGFLLSLGIHLNAAAKYTSFPWLLNAAARFSWLSLLVPFNAAFPHCLAFSYMKAALVVD 
FSQFSRGAILLLCLIFLLNAAAHTLWKAGILYKKAWMMWYWGPSLYKAYPALMPLYACIGAAAWLSLLVPFVNFLLTRI 
LTINIPIPSSWAFKAAAEYLVSFGVWNLPSDFFPSVKFLPSDFFPSVKDLLDTASALYNSWPKFAVPNLKAAASAICSV 
VRPJCLSLDVSMFYNAMKFVMlfrLKAMKMNVSIPWTHKGMGLSRYVARLNAMSTLPETTVVRRKHPAAMPHLL 
KAAARWMCLRRFIINASFCGSPYKMYMDDVVLGVNALWFHISCLT^ 

l I rbr I YK 

ATragGAAinBC^CT 

TGGGCATCCACCTAMTGCTGCTGCAAMTACACATCTTTTCCTTGGCTCCTTMTGCCGCCGCTAGGTTTTCATGGCT 

GAGTCTGCTAGTACCTTTCMTGCGGCTTTCCCACATTGCCTAGCTTTTAGCTATATGAMGCTGCTl^ 

TTTTCACAGTTTAGCAGAGGAGCMTCCTGCTGCTATGTCTGATATTCCTTCTAMCGCAGCAGCCCACACACTCT^ 

AAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGGGGACCCAGCCTCTACAAAGCATACCCTGCCCTGAT 
GCCACTATACGCATGCATTGGCGCGGCAGC^^ 

CTGACGATTMTATTCCGATCCCMGmCTGGGCATTCAMGCAGCCGCGGAGTATCTGGTTTCAm^ 

ACCTGCCMGCGACTTCmCCnCTGTTMGTTCCTCCCCTCCGAmCTTTCCATCGGTGAMGA^ 

CGCGAGCGCTCTGTACMCTCGTGGCCAAMTTCGCAGTTCCAAACCTAAAAGCCGCCGCCAGTGCCATTTGTTCCGTG 

55T^ GCCGCCGCCTCAACACTGCCTGAGACTACTGTCGTGA ^ 

AMGCATCCGCAC^^ 
ACATGGAC^TGTG^ 

TTCACAT^^ 

FIG.22E 
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HBV-CL 

MQVQIQSLFLLLLWVPGSRGFLLSLGIHLNAMKYTSFPWLLNAAARFSWLSLLVPFNAAFPHCLAFSYMKA 
ALVVDFSQFSRGAILLLCLIFLLNAAAHTLWKAGILYKKAWMMWYWGPSLYKAYPALMPLYACIGAAAWLSL 
LVPFVNFLLTRI LTI NAAAI P I PSSWAFKAAAEYLVSFGVWNLPSDFFPS VKAAAFLPSDFFPS VKAAADLL 
DTASALYNSWPKFAVPNLKAAASAICSVVRRKLSLDVSAAFYNAAAKFVAAWTLKAAAKAANVS I PWTHKGA 
AGLSRYVARLNAAASTLPETTVVRRKHPAAMPHLLKAAARWMCLRRF I I NASFCGSPYKAAYMDDVVLGVNA 
LWFHISCLTFKAAATPARVTGGVFKAAALTFGRETVLEYKQAFTFSPTYK 

ATGGGMTGCAGGTGCAMTACAGTCTCTCTTCCTTTTGCTTCTCTGGGTTCCAGGATCACGGGGCTTCTTG 
CTTAGCTTGGGCATCCACCTAAATGCTGCTGCAAMTACACATCTTTTCCTTGGCTCCTTAATGCCGCCGCT 
AGGTTTTCATGGCTGAGTCTGCTAGTACCTTTCMTGCGGCTTTCCCACATTGCCTAGCTTTTAGCTATATG 
AMGCTGCTTTAGTCGTGGACTTTTCACAGTTTAGCAGAGGAGCMTCCTGCTGCTATGTCTGATATTCCTT 
CTAAACGCAGCAGCCCACACACTCTGGAAAGCTGGTATCCTTTACAAGAAAGCCTGGATGATGTGGTATTGG 
GGACCCAGCCTCTACAAAGCATACCCTGCCCTGATGCCACTATACGCATGCATTGGCGCGGCAGCCTGGTTA 
TCCCTTTTAGTACCGTTTGTCMCTTTCTATTAACCAGAATCCTGACGATTAATGCTGCCGCCATTCCGATC 
CCMGTTCCTGGGCATTCAMGCAGCCGCGGAGTATCTGGTTTCATTTGGCGTATGGAACCTGCCAAGCGAC 
TTCTTTCCTTCTGTTMGGCCGCTGCTTTCCTCCCCTCCGATTTCTTTCCATCGGTGAAAGCCGCTGCCGAC 
CTCCTTGATACCGCGAGCGCTCTGTACAACTCGTGGCCAAAATTCGCAGTTCCAAACCTAAAAGCCGCCGCC 
AGTGCCATTTGTTCCGTGGTMGGAGAAMTTATCACTCGACGTGTCCGCAGCATTTTATAACGCTGCTGCA 
AAGTTTGTCGCAGCATGGACATTGAAGGCTGCAGCGAAAGCAGCAAATGTATCAATACCCTGGACCCACAAG 
GGTGCAGCCGGGCTGTCTAGGTATGTGGCGAGGCTAAACGCCGCCGCCTCAACACTGCCTGAGACTACTGTC 
GTGAGACGCAAACACCCTGCCGCAATGCCCCACCTGCTGAAAGCAGCCGCACGATGGATGTGCCTCAGAAGA 
TTCATMTAMCGCTTCTTTCTGTGGGTCACCCTACAAAGCCGCTTACATGGACGATGTGGTCCTCGGAGTG 
AATGCCCTCTGGTTCCATATCAGCTGCCTGACATTCAAGGCAGCCGCCACCCCCGCTCGTGTGACAGGAGGT 
GTCTTCAMGCCGCGGCACTGACTTTCGGTCGGGAAACTGTATTGGAATATAAGCAGGCCTTCACATTCTCC 
CCAACATACAAGTGA 



FIG.23C 
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HBV-HTL 

MGTSFVYVPSALNPADGPGPGLCQVFADATPTGWGLGPGPGRHYLHTLWKAGILYKGPGPGPHHTALRQAILC 
WGELMTLAGPGPGESRLVVDFSQFSRGNGPGPGPFLLAQFTSAICSVVGPGPGLVPFVQWFVGLSPTVGPGPG 
LHLYSHPIILGFRKIGPGPGSSNLSWLSLDVSAAFGPGPGLQSLTNLLSSNLSWLGPGPGAGFFLLTRILTIP 
QSGPGPGVS FGVW I RTPPAYRPPNAP I GPGPG VGP LTVNEKRRLKL I GPGPGKQC FRKLP VNRP I DWGPGPGA 
ANWILRGTSFVYVPGPGPGKQAFTFSPTYKAFLCGPGPGAKFVAAWTLKAAA 

ATGGGMCTTCTTTTGTGTATGTCCCTTCCGCTCTGAACCCAGCAGACGGACCCGGGCCTGGCCTGTGCCAGG 
TCTTCGCCGACGCAACTCCCACAGGGTGGGGGCTGGGGCCAGGACCAGGCAGGCACTACCTGCATACTCTGTG 
GAAGGCAGGAATCCTCTATAAAGGGCCCGGCCCAGGCCCTCACCACACCGCCCTGAGGCAGGCCATCCTGTGC 
TGGGGGGAGCTCATGACCCTGGCCGGACCTGGACCCGGGGAGAGCAGACTGGTGGTGGATTTCAGCCAATTCA 
GCAGAGGAMCGGACCCGGCCCTGGGCCTTTTCTGCTGGCTCAGTTTACATCTGCTATTTGTTCTGTGGTCGG 
CCCCGGGCCCGGACTCGTGCCTTTCGTGCAGTGGTTCGTGGGACTGTCCCCTACAGTCGGGCCCGGCCCAGGG 
CTGCATCTGTACTCCCACCCAATCATCCTCGGCTTCCGCAAGATTGGACCCGGCCCAGGCTCCAGCAATCTCT 
CCTGGCTCTCTCTGGACGTGTCTGCCGCCTTTGGCCCTGGACCAGGCCTGCAAAGCCTGACTAATCTGCTCAG 
eAGCAACCTGTCCTGGCTGGGACCTGGCCCAGGGGCTGGCTTCTTTCTGCTCACCCGGATTCTCACAATTCCC 
CAGTCCGGACCAGGACCAGGAGTCAGTTTCGGGGTGTGGATCAGGACCCCTCCTGCTTATAGACCACCCAATG 
CTCCAATCGGCCCCGGCCCTGGCGTCGGGCCACTGACCGTGAATGAGAAGCGCCGGCTGAAGCTGATCGGCCC 
TGGCCCTGGCMGCAGTGCTTTCGCAAACTGCCCGTGAACAGACCTATTGATTGGGGCCCCGGCCCTGGAGCA 
GCCMCTGGATTCTCAGGGGMCMGCTTCGTCTACGTGCCCGGGCCCGGACCAGGGMGCAGGCTTTTACCT 
TCTCTCCCACTTACAAGGCCTTCCTCTGTGGGCCAGGCCCCGGCGCCAAGTTTGTGGCAGCATGGACCCTCAA 
AGCCGCTGCCTGA 



FIG.24C 
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